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Description 

This invention relates to DNA polymerases suitable for DNA sequencing and in particular relates to a 
method of producing the purified modified polymerases and to a purified modified gene encoding a 
5 modified DNA polymerase. 

DNA sequencing involves the generation of four populations of single stranded DNA fragments having 
one defined terminus and one variable terminus. The variable terminus always terminates at a specific given 
nucleotide base (either guanine (G), adenine (A), thymine (T), or cytosine (C)). The four different sets of 
fragments are each separated on the basis of their length, on a high resolution polyacrylamide gel; each 
70 band on the gel corresponds colinearly to a specific nucleotide in the DNA sequence, thus identifying the 
positions in the sequence of the given nucleotide base. 

Generally there are two methods of DNA sequencing. One method (Maxam and Gilbert sequencing) 
involves the chemical degradation of isolated DNA fragments, each labeled with a single radiolabel at its 
defined terminus, each reaction yielding a limited cleavage specifically at one or more of the four bases (G, 
75 A, T or C). The other method (dideoxy sequencing) involves the enzymatic synthesis of a DNA strand. Four 
separate syntheses are run, each reaction being caused to terminate at a specific base (G t A, T or C) via 
incorporation of the appropriate chain terminating dideoxynucleotide. The latter method is preferred since 
the DNA fragments are uniformly labelled (instead of end labelled) and thus the larger DNA fragments 
contain increasingly more radioactivity. Further, 35 S-labelled nucleotides can be used in place of 32 P- 
20 labelled nucleotides, resulting in sharper definition; and the reaction products are simple to interpret since 
each lane corresponds only to either G, A, T or C. The enzyme used for most dideoxy sequencing is the 
Escherichia coli DNA-polymerase I large fragment ("Klenow"). Another polymerase used is AMV reverse 
transcriptase. 

25 Summary of the Invention 

In one aspect the invention features a method for producing a purified modified DNA polymerase which 
method comprises expressing a modified gene which gene encodes a modified processive DNA poly- 
merase which has sufficient DNA polymerase activity for use in DNA sequencing when said polymerase is 

30 combined with any cofactor necessary for said DNA polymerase activity and which results from the 
modification of a naturally occuring gene modified in that one or more amino acids in the 3* - 5' 
exonuclease domain of said naturally ocurring DNA polymerase are replaced by an amino acid other than 
that naturally occuring at the site of substitution or are deleted so as to reduce the activity of naturally 
occurring 3* - 5' exonuclease activity of the naturally occurring DNA polymerase. 

35 In another aspect the invention features a purified modified gene that encodes a processive modified 
T7-type DNA polymerase which polymerase is able to remain bound to DNA for at least 500 bases under 
conditions normally used for DNA sequencing reactions and which has sufficient DNA polymerase activity 
for use in DNA sequencing when said polymerase is combined with any host factor necessary for said DNA 
polymerase activity and which results from the modification of a naturally occurring gene modified to reduce 

40 the activity of naturally occurring 3'- 5' exonuclease activity of the naturally occurring DNA polymerase 
wherein one or more amino acids of the exonuclease domain within the amino terminal half of the T7 DNA 
polymerase of said naturally occurring DNA polymerase, or the corresponding domain of other T7-type 
DNA polymerases, are replaced by an amino acid other than that naturally occurring at the site of 
substitution or are deleted. 

45 In a preferred embodiment one or more of the amino acids of the exonuclease domain from the amino 
terminal to amino acid residue 224 of T7 DNA polymerase of said naturally occurring DNA polymerase, or 
the corresponding domain of other T7-type DNA polymerases, are replaced by an amino acid other than 
that naturally occurring at the site of substitution or are deleted. 

In preferred embodiments of the above mentioned aspects of the invention the modified polymerase 

so encoded by the gene is able to remain bound to a DNA molecule for at least 500 bases under conditions 
normally used for DNA sequencing before dissociating, most preferably for at least 1,000 bases; the 
polymerase activity of the modified polymerase is at least 90% of that of the naturally ocurring DNA 
polymerase; the gene encodes a modified polymerase which is substantially the same as one in cells 
infected with a T7-type phage (i.e., phage in which the DNA polymerase requires host thioredoxin as a 

55 subunit) for example, the T7-type phage is T7, T3, *l, *ll, H, W31, gh-l, Y, AII22, or Sp6; the polymerase is 
non-discriminating for dideoxy nucleotide analogs; the polymerase has a 3' - 5* exonuclease activity at least 
50% lower than the naturally-occurring exonuclease activity of naturally occurring T7-type DNA polymerase; 
the polymerase is modified to reduce the activity of the naturally occurring 3* - 5 1 exonuclease activity to 
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less than 500 units per mg of polymerase, more preferably less than 1 unit, even more preferably less than 
0.1 unit, and most preferably has no detectable exonuclease activity; the polymerase is able to utilize 
primers of as short as 10 bases or preferably as short as 4 bases; the primer comprises four to forty 
nucleotide bases, and is single stranded DNA or RNA. Preferably the modified gene encodes a modified 

5 DNA polymerase in which a naturally occurring His residue of the naturally occurring DNA polymerase is 
replaced or deleted. In embodiments described herein after His 123, Ser 122 and His 123, Lys 118 to His 
123, Lys 118 and Arg 119 or Arg 131, Lys 136, Lys 140, Lys 144 and Arg 145 of the naturally occurring T7 
DNA Polymerase are replaced or deleted. 

This invention provides a DNA polymerase which is processive, non-discriminating, and can utilize short 

w primers. Further, the polymerase has no associated exonuclease activity. These are ideal properties for the 
above described methods, and in particular for DNA sequencing reactions, since the background level of 
radioactivity in the polyacylamide gels is negligible, there are few or no artifactual bands, and the bands are 
sharp - making the DNA sequence easy to read. Further, such a polymerase allows novel methods of 
sequencing long DNA fragments, as is described in detail below. 

75 Other features and advantages of the invention will be apparent from the following description of the 
preferred embodiments thereof and from the claims. 

Description of the Preferred Embodiments 

20 The drawings will first briefly be described. 

Drawings 

Figs. 1-3 are diagrammatic representations of the vectors pTrx-2, mGP1-1, and pGP5-5 respectively; 
25 Fig. 4 is a graphical representation of the selective oxidation of T7 DNA polymerase; 

Fig. 5 is a graphical representation of the ability of modified 17 polymerase to synthesize DNA in the 
presence of etheno-dATP; and 

Fig. 6 is a diagrammatic representation of the enzymatic amplification of genomic DNA using modified 
T7 DNA polymerase. 

30 Fig. 7, 8 and 9 are the nucleotide sequences of pTrx-2, a part of pGP5-5 and mGP1-2 respectively. 
Fig. 10 is a diagrammatic representation of pGP5-6. 

DNA Polymerase 

35 in general the DNA polymerase of this invention is processive, has no associated exonuclease activity, 
does hot discriminate against nucleotide analog incorporation, and can utilize small oligonucleotides (such 
as tetramers, hexamers and octamers) as specific primers. These properties will now be discussed in detail. 

Processivity 

AO 

By processivity is meant that the DNA polymerase is able to continuously incorporate many nucleotides 
using the same primer-template without dissociating from the template, under conditions normally used for 
DNA sequencing extension reactions. The degree of processivity varies with different polymerases: some 
incorporate only a few bases before dissociating (e.g. Klenow (about 15 bases), T4 DNA polymerase (about 

45 10 bases), T5 DNA polymerase (about 180 bases) and reverse transcriptase (about 200 bases) (Das et al. J. 
Biol. Chem. 254:1227 1979; Bambara et al., J. Biol. Chem 253:413, 1978) while others, such as those of the 
present invention, will remain bound for at least 500 bases and preferably at least 1,000 bases under 
suitable environmental conditions. Such environmental conditions include having adequate supplies of all 
four deoxynucleoside triphosphates and an incubation temperature from 10°C-50*C. Processivity is greatly 

so enhanced in the presence of E. coli single stranded binding (ssb), protein. 

With processive enzymes termination of a sequencing reaction will occur only at those bases which 
have incorporated a chain terminating agent, such as a dideoxynucleotide. If the DNA polymerase is non- 
processive, then artifactual bands will arise during sequencing reactions, at positions corresponding to the 
nucleotide where the polymerase dissociated. Frequent dissociation creates a background of bands at 

55 incorrect positions and obscures the true DNA sequence. This problem is partially corrected by incubating 
the reaction mixture for a long time (30-60 min) with a high concentration of substrates, which "chase" the 
artifactual bands up to a high molecular weight at the top of the gel, away from the region where the DNA 
sequence is read. This is not an ideal solution since a non-processive DNA polymerase has a high 
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probability of dissociating from the template at regions of compact secondary structure, or hairpins. 
Reinitiation of primer elongation at these sites is inefficient and the usual result is the formation of bands at 
the same position for all four nucleotides, thus obscuring the DNA sequence. 

5 Analog discrimation 

The DNA polymerases of this invention do not discriminate significantly between dideoxy-nucleotide 
analogs and normal nucleotides. That is, the chance of incorporation of an analog is approximately the 
same as that of a normal nucleotide or at least incorporates the analog with at least 1/10 the efficiency that 

to of a normal analog. The polymerases of this invention also do not discriminate significantly against some 
other analogs. This is important since, in addition to the four normal deoxynucleoside triphosphates (dGTP, 
dATP, dTTP and dCTP), sequencing reactions require the incorporation of other types of nucleotide 
derivatives such as: radioactively-or fluorescently-labelled nucleoside triphosphates, usually for labeling the 
synthesized strands with 35 S, ^P, or other chemical agents. When a DNA polymerase does not discriminate 

75 against analogs the same probability will exist for the incorporation of an analog as for a normal nucleotide. 
For labelled nucleoside triphosphates this is important in order to efficiently label the synthesized DNA 
strands using a minimum of radioactivity. Further, lower levels of analogs are required with such enzymes, 
making the sequencing reaction cheaper than with a discriminating enzyme. 

Discriminating polymerases show a different extent of discrimination when they are polymerizing in a 

20 processive mode versus when stalled, struggling to synthesize through a secondary structure impediment. 
At such impediments there will be a variability in the intensity of different radioactive bands on the gel, 
which may obscure the sequence. 

Exonuclease Activity 

25 

The DNA polymerase of the invention has less than 50%, preferably less than 1%, and most preferably 
less than 0.1%, of the normal or naturally associated level of exonuclease activity (amount of activity per 
polymerase molecule). By normal or naturally associated level is meant the exonuclease activity of 
unmodified T7-type polymerase. Normally the associated activity is about 5,000 units of exonuclease 

30 activity per mg of polymerase, measured as described below by a modification of the procedure of Chase 
et al. (249 J. Biol. Chem. 4545, 1974). Exonucleases increase the fidelity of DNA synthesis by excising any 
newly synthesized bases which are incorrectly basepaired to the template. Such associated exonuclease 
activities are detrimental to the quality of DNA sequencing reactions. They raise the minimal required 
concentration of nucleotide precursors which must be added to the reaction since, when the nucleotide 

35 concentration falls, the polymerase activity slows to a rate comparable with the exonuclease activity, 
resulting in no net DNA synthesis, or even degradation of the synthesized DNA. 

More importantly, associated exonuclease activity will cause a DNA polymerase to idle at regions in the 
template with secondary structure impediments. When a polymerase approaches such a structure its rate of 
synthesis decreases as it struggles to pass. An associated exonuclease will excise the newly synthesized 

40 DNA when the polymerase stalls. As a consequence numerous cycles of synthesis and excision will occur. 
This may result in the polymerase eventually synthesizing past the hairpin (with no detriment to the quality 
of the sequencing reaction); or the polymerase may dissociate from the synthesized strand (resulting in an 
artifactual band at the same position in all four sequencing reactions); or, a chain terminating agent may be 
incorporated at a high frequency and produce a wide variability in the intensity of different fragments in a 

45 sequencing gel. This happens because the frequency of incorporation of a chain terminating agent at any 
given site increases with the number of opportunities the polymerase has to incorporate the chain 
terminating nucleotide, and so the DNA polymerase will incorporate a chain-terminating agent at a much 
higher frequency at sites of idling than at other sites. 

An ideal sequencing reaction will produce bands of uniform intensity throughout the gel. This is 

50 essential for obtaining the optimal exposure of the X-ray film for every radioactive fragment. If there is 
variable intensity of radioactive bands, then fainter bands have a chance of going undetected. To obtain 
uniform radioactive intensity of all fragments, the DNA polymerase should spend the same interval of time 
at each position on the DNA, showing no preference for either the additon or removal of nucleotides at any 
given site. This occurs if the DNA polymerase lacks any associated exonuclease, so that it will have only 

55 one opportunity to incorporate a chain terminating nucleotide at each position along the template. 
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Short primers 

The DNA polymerase of the invention is able to utilize primers of 10 bases or less, as well as longer 
ones, most preferably of 4-20 bases. The ability to utilize short primers offers a number of important 

5 advantages to DNA sequencing. The shorter primers are cheaper to buy and easier to synthesize than the 
usual 15-20-mer primers. They also anneal faster to complementary sites on a DNA template, thus making 
the sequencing reaction faster. Further, the ability to utilize small (e.g., six or seven base) oligonucleotide 
primers for DNA sequencing permits strategies not otherwise possible for sequencing long DNA fragments. 
For example, a kit containing 80 random hexamers could be generated, none of which are complementary 

w to any sites in the cloning vector. Statistically, one of the 80 hexamer sequences will occur an average of 
every 50 bases along the DNA fragment to be sequenced. The determination of a sequence of 3000 bases 
would require only five sequencing cycles. First, a "universal" primer (e.g.. New England Biolabs #1211, 
sequence 5' GTAAAACGACGGCCAGT 3') would be used to sequence about 600 bases at one end of the 
insert. Using the results from this sequencing reaction, a new primer would be picked from the kit 

75 homologous to a region near the end of the determined sequence. In the second cycle, the sequence of the 
next 600 bases would be determined using this primer. Repetition of this process five times would 
determine the complete sequence of the 3000 bases, without necessitating any subcloning, and without the 
chemical synthesis of any new oligonucleotide primers. The use of such short primers may be enhanced by 
including gene 2.5 and 4 protein of 17 in the sequencing reaction. 

20 DNA polymerases of this invention, (i.e., having the above properties) include modified T7-type 
polymerases. That is the DNA polymerase requires host thioredoxin as a sub-unit, and they are substan- 
tially identical to a modified T7 DNA polymerase or to equivalent enzymes isolated from related phage, 
such as T3, <f>\, 4>\\, H, W31, gh-1, Y, A1122 and SP6. Each of these enzymes can be modified to have 
properties similar to those of the modified T7 enzyme. It is possible to isolate the enzyme from phage 

25 infected cells directly, but preferably the enzyme is isolated from cells which overproduce it By substan- 
tially identical is meant that the enzyme may have amino acid substitutions which do not affect the overall 
properties of the enzyme. One example of a particularly desirable amino acid substitution is one in which 
the natural enzyme is modified to remove any exonuclease activity. This modification may be performed at 
the genetic or chemical level (see below). 

30 

Cloning T7 polymerase 

As an example of the invention we shall describe the cloning, overproduction, purification, modification 
and use of T7 DNA polymerase. This processive enzyme consists of two polypeptides tightly complexed in 

35 a one to one stoichiometry. One is the phage T7-encoded gene 5 protein of 84,000 daltons (Modrich et al. 
150 J. Biol. Chem. 5515, 1975), the other is the E. cqli encoded thioredoxin, of 12,000 daltons (Tabor et al., 
J. Biol, Chem. 262 :16, 216, 1987). The thioredoxin is an accessory protein and attaches the gene 5 protein 
(the non-processive actual DNA polymerase) to the primer template. The natural DNA polymerase has a 
very active 3' to 5' exonuclease associated with it. This activity makes the polymerase useless for DNA 

40 sequencing and must be inactivated or modified before the polymerase can be used. This is readily 
performed, as described below, either chemically, by local oxidation of the exonuclease domain, or 
genetically, by modifying the coding region of the polymerase gene encoding this activity. 

pTrx-2 

45 

In order to clone the trxA (thioredoxin) gene of E. colt wild type E. coli DNA was partially cleaved with 
Sau 3A and the fragments ligated to Bam HI-cleaved T7 DNA isolated from strain T7 ST9 (Tabor et al., in 
Thioredoxin and Glutaredoxin Systems: Structure and Function (Holmgren et al., eds) pp. 285-300, Raven 
Press, NY; and Tabor et al., supra ). The ligated DNA was transfected into E. coli trxA" cells, the mixture 

50 plated onto trxA" cells, and the resulting T7 plaques picked. Since T7 cannot grow without an active E. coli 
trxA gene only those phages containing the trxA gene could form plaques. The cloned trxA genes were 
located on a 470 base pair Hindi fragment. 

In order to overproduce thioreodoxin a plasmid, pTrx-2, was as constructed. Briefly, the 470 base pair 
Hindi fragment containing the trxA gene was isolated by standard procedure (Maniatis et al., Cloning: A 

55 Laboratory Manual, Cold Spring Harbor Labs., Cold Spring Harbor, N.Y.), and ligated to a derivative of 
pBR322 containing a Ptac promoter (ptac-12, Amann et al., 25 Gene 167, 1983). Referring to Fig. 2, ptac- 
12, containing ^-lactamase and Col El origin, was cut with Pvull, to yield a fragment of 2290 bp, which was 
then ligated to two tandem copies of trxA (Hindi fragment) using commercially available linkers (Smal- 
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Bam HI polylinker), to form pTrx-2. The complete nucleotide sequence of pTrx-2 is shown in Figure 7. 
Thioredoxin production is now under the control of the tec promoter, and thus can be specifically induced, 
e.g. by IPTG (isopropyl 0-D-thiogalactoside). 

5 pGP5-5 and mGP1-2 

Some gene products of T7 are lethal when expressed in E. coli. An expression system was developed 

to facilitate cloning and expression of, lethal genes, based on the inducible expression of 17 RNA 

polymerase. Gene 5 protein is lethal in some E. coli strains and an example of such a system is described 
ro by Tabor et al. 82 Proc. Nat. Acad. Sci. 1074 (1985) where T7 gene 5 was placed under the control of the 

$10 promoter, and is only expressed when T7 RNA polymerase is present in the cell. 

Briefly, pGP5-5 (Fig. 3) was constructed by standard procedures using synthetic Bam HI linkers to join 

T7 fragment from 14306 (Nde l) to 16869 (Ahalll), containing gene 5, to the 560 bp fragment of T7 from 

5667 (Hindi) to 6166 (Fnu4H1) containing both the $1.1 A and $1.1 B promoters, which are recognized by 
75 T7 RNA polymerase, and the 3kb Bam HI-Hincll fragment of pACYC177 (Chang et al., 134 J. Bacteriol. 

1141, 1978). The nucleotide sequence of the T7 inserts and linkers in shown in Fig. 8. In this plasmid gene 

5 is only expressed when T7 RNA polymerase is provided in the cell. 

Referring to Fig. 3, T7 RNA polymerase is provided on phage vector mGP1-2. This is similar to pGP1-2 

(Tabor et al., id.) except that the fragment of T7 from 3133 (Haelll) to 5840 (Hinfl), containing T7 RNA 
20 polymerase was ligated, using linkers (Bglll and Sail respectively), to Bam HI-Sall cut M13 mp8, placing the 

polymerase gene under control of the lac promoter. The complete nucleotide sequence of mGP1-2 is 

shown in Fig. 9. 

Since pGP5-5 and pTrx-2 have different origins of replication (respectively a P15A and a ColE1 origin) 
they can be tranformed into one cell simultaneously. pTrx-2 expresses large quantities of thioredoxin in the 
25 presence of IPTG. mGP1-2 can coexist in the same cell as these two plasmids and be used to regulate 
expression of T7-DNA polymerase from pGP5-5, simply by causing production of T7-RNA polymerase by 
inducing the tec promoter with, e.g., IPTG. 

Overproduction of T7 DNA polymerase 

30 

There are several potential strategies for overproducing and reconstituting the two gene products of 
trxA and gene 5. The same cell strains and plasmids can be utilized for all the strategies. In the preferred 
strategy the two genes are co-overexpressed in the same cell. (This is because gene 5 is susceptible to 
proteases until thioredoxin is bound to it.) As described in detail below, one procedure is to place the two 

35 genes separately on each of two compatible plasmids in the same cell. Alternatively, the two genes could 
be placed in tandem on the same plasmid. It is important that the T7-gene 5 is placed under the control of 
a non-leaky inducible promoter, such as $1.1 A, $1.1 B and $10 of T7, as the synthesis of even small 
quantities of the two polypeptides together is toxic in most E. coli cells. By non-leaky is meant that less 
than 500 molecules of the gene product are produced, per cell generation time, from the gene when the 

40 promoter, controlling the gene's expression, is not activated. Preferably the T7 RNA polymerase expression 
system is used although other expression systems which utilize inducible promoters could also be used. A 
leaky promoter, e.g., plac, allows more than 500 molecules of protein to be synthesized, even when not 
induced, thus cells containing lethal genes under the control of such a promoter grow poorly and are not 
suitable in this invention. It is of course possible to produce these products in cells where they are not 

45 lethal, for example, the plac promoter is suitable in such cells. 

In a second strategy each gene can be cloned and overexpressed separately. Using this strategy, the 
cells containing the individually overproduced polypeptides are combined prior to preparing the extracts, at 
which point the two polypeptides form an active T7 DNA polymerase. 

50 Example 1 : Production of T7 DNA polymerase 

E. coli strain 71.18 (Messing et al., Proc. Nat. Acad. Sci. 74:3642, 1977) is used for preparing stocks of 
mGP1-2. 71.18 is stored in 50% glycerol at -80 *C. and is streaked on a standard minimal media agar plate. 
A single colony is grown overnight in 25 mi standard M9 media at 37 *C, and a single plaque of mGP1-2 is 
55 obtained by titering the stock using freshly prepared 71.18 cells. The plaque is used to inoculate 10 ml 2X 
LB (2% Bacto-Tryptone, 1% yeast extract, 0.5% NaCI, 8mM NaOH) containing JM103 grown to an 
As3o=0.5. This culture will provide the phage stock for preparing a large culture of mGP1-2. After 3-12 
hours, the 10 ml culture is centrifuged, and the supernatant used to infect the large (21) culture. For the 
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large culture, 4 X 500 ml 2X LB is inoculated with 4 X 5 ml 71.18 cells grown in M9, and is shaken at 37*0. 
When the large culture of cells has grown to an As9t> = 1.0 (approximately three hours), they are inoculated 
with 10 ml of supernatant containing the starter lysate of mGP1-2. The infected cells are then grown 
overnight at 37 *C. The next day, the cells are removed by centrifugation, and the supernatant is ready to 

5 use for induction of K38/pGP5-5/pTrx-2 (see below). The supernatant can be stored at 4*C for approxi- 
mately six months, at a titer -5 X 10 11 <f>/ml. At this titer, 1 L of phage will infect 12 liters of cells at an 
Asso =5 with a multiplicity of infection of 15. If the titer is low, the mGP1-2 phage can be concentrated from 
the supernatant by dissolving NaCI (60 gm/liter) and PEG-6000 (65 gm/liter) in the supernatant, allowing the 
mixture to settle at 0*C for 1-72 hours, and then centrifuging (7000 rpm for 20 min). The precipitate, which 

70 contains the mGP1-2 phage, is resuspended in approximately 1/20th of the original volume of M9 media. 

K38/pGP5-5/pTrx-2 is the E. coli strain (genotype HfrC (X)) containing the two compatible plasmids 
pGP5-5 and pTrx-2. pGP5-5 plasmid has a P15A origin of replication and expresses the kanamycin (Km) 
resistance gene. pTrx-2 has a ColEI origin of replication and expresses the ampicillin (Ap) resistance gene. 
The plasmids are introduced into K38 by standard procedures, selecting Km R and Ap R respectively. The 

75 cells K38/pGP5-5/pTrx-2 are stored in 50% glycerol at -80 • C. Prior to use they are streaked on a plate 
containing 50ug/ml ampicillin and kanamycin, grown at 37 *C overnight, and a single colony grown in 10 ml 
LB media containing 50ug/ml ampicillin and kanamycin, at 37 *C for 4-6 hours. The 10 ml cell culture is 
used to inoculate 500 ml of LB media containing 50ug/ml ampicillin and kanamycin and shaken at 37 * C 
overnight. The following day, the 500 ml culture is used to inoculate 12 liters of 2X LB-KPO4 media (2% 

20 Bacto-Tryptone, 1% yeast extract, 0.5% NaCI, 20 mM KPO*, 0.2% dextrose, and 0.2% casamino acids, pH 
7.4), and grown with aeration in a fermentor at 37 *C. When the cells reach an A590 =5.0 (i.e. logarithmic or 
stationary phase cells), they are infected with mGP1-2 at a multiplicity of infection of 10, and IPTG is added 
(final concentration 0.5mM). The IPTG induces production of thioredoxin and the T7 RNA polymerase in 
mGP1-2, and thence induces production of the cloned DNA polymerase. The cells are grown for an 

25 additional 2.5 hours with stirring and aeration, and then harvested. The cell pellet is resuspended in 1.5 L 
10% sucrose/20 mM Tris-HCI, pH 8.0/25 mM EDTA and re-spun. Finally, the cell pellet is resuspended in 
200 ml 10% sucrose/20 mM Tris-HCI, pH 8/1.0 mM EDTA, and frozen in liquid N 2 . From 12 liters of 
induced cells 70 gm of cell paste are obtained containing approximately 700 mg gene 5 protein and 100 
mg thioredoxin. 

30 K38/pTrx-2 (K38 containing pTrx-2 alone) overproduces thioredoxin, and it is added as a "booster" to 
extracts of K38/pGP5-5/pTrx-2 to insure that thioredoxin is in excess over gene 5 protein at the outset of the 
purification. The K38/pTrx-2 cells are stored in 50% glycerol at -80 - C. Prior to use they are streaked on a 
plate containing 50 ug/ml ampicillin, grown at 37 'C for 24 hours, and a single colony grown at 37 *C 
overnight in 25 ml LB media containing 50 ug/ml ampicillin. The 25 ml culture is used to inoculate 2 L of 2X 

35 LB media and shaken at 37 °C. When the cells reach an A590 =3.0, the ptac promoter, and thus thioredoxin 
production, is induced by the addition of IPTG (final concentration 0.5 mM). The cells are grown with 
shaking for an additional 12-16 hours at 37 W C, harvested, resuspended in 600 ml 10% sucrose/20 mM Tris- 
HCI, pH 8.0/25 mM EDTA, and re-spun. Finally, the cells are resuspended in 40 ml 10% sucrose/20 mM 
Tris-HCI, pH 8/0.5 mM EDTA, and frozen in liquid N 2 . From 2L of cells 16 gm of cell paste are obtained 

40 containing 150 mg of thioredoxin. 

Assays for the polymerase involve the use of single-stranded calf thymus DNA (6mM) as a substrate. 
This is prepared immediately prior to use by denaturation of double-stranded calf thymus DNA with 50 mM 
NaOH at 20 *C for 15 min., followed by neutralization with HCI. Any purified DNA can be used as a 
template for the polymerase assay, although preferably it will have a length greater than 1 ,000 bases. 

45 The standard T7 DNA polymerase assay used is a modification of the procedure described by Grippo 
et al. (246 J. Biol. Chem. 6867, 1971). The standard reaction mix (200 ul final volume) contains 40 mM 
Tris/HCI pH 7.5, 10 mM MgCI 2 , 5 mM dithiothreitol, 100 nmol alkali-denatured calf thymus DNA, 0.3 dGTP, 
dATP, dCTP and pHJdTTP (20 cpm/pm), 50 ug/ml BSA, and varying amounts of T7 DNA polymerase. 
Incubation is at 37 *C (10*C-45*C) for 30 min (5 min-60 min). The reaction is stopped by the addition of 3 

50 ml of cold (0 • C) 1 N HCI-0.1 M pyrophosphate. Acid-insoluble radioactivity is determined by the procedure 
of Hinkle et al. (250 J. Biol. Chem. 5523, 1974). The DNA is precipitated on ice for 15 min (5 min-12 hr), 
then precipitated onto glass-fiber filters by filtration. The filters are washed five times with 4 ml of cold 
(0'C) 0.1 M HCI-0.1 M pyrophosphate, and twice with cold (0*C) 90% ethanol. After drying, the radioactivity 
on the filters is counted using a non-aqueous scintillation fluor. 

55 One unit of polymerase activity catalyzes the incorporation of 10 nmol of total nucleotide into an acid- 
soluble form in 30 min at 37 • C. under the conditions given above. Native T7 DNA polymerase and modified 
T7 DNA polymerase (see below) have the same specific polymerase activity ± 20%, which ranges between 
5,000-20,000 units/mg for native and 5,000-50,000 units/mg for modified polymerase) depending upon the 
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preparation, using the standard assay conditions stated above. 

T7 DNA polymerase is purified from the above extracts by precipitation and chromatography tech- 
niques. An example of such a purification follows. 

An extract of frozen cells (200 ml K38/pGP5-5/pTrx-2 and 40 ml K38/pTrx-2) are thawed at 0*C 
5 overnight. The cells are combined, and 5 ml of lysozyme (15 mg/ml) and 10 ml of NaCI (5M) are added. 
After 45 min at 0 * C, the cells are placed in a 37 • C water bath until their temperature reaches 20 • C. The 
cells are then frozen in liquid N 2 . An additional 50 ml of NaCI (5M) is added, and the cells are thawed in a 
37 'C water bath. After thawing, the cells are gently mixed at 0"C for 60 min. The lysate is centrifuged for 
one hr at 35,000 rpm in a Beckman 45Ti rotor. The supernatant (250 ml) is fraction I. It contains 
10 approximately 700 mg gene 5 protein and 250 mg of thioredoxin (a 2:1 ratio thioredoxin to gene 5 protein). 

90 gm of ammonium sulphate is dissolved in fraction I (250 ml) and stirred for 60 min. The suspension 
is allowed to sit for 60 min, and the resulting precipitate collected by centrifugation at 8000 rpm for 60 min. 
The precipitate is redissolved in 300 ml of 20 mM Tris-HCI pH 7.5/5 mM 2-mercaptoethanol/0.1 mM 
EDTA/10% glycerol (Buffer A). This is fraction II. 
15 A column of Whatman DE52 DEAE (12.6 cm 2 x 18 cm) is prepared and washed with Buffer A. Fraction 
II is dialyzed overnight against two changes of 1 L of Buffer A each until the conductivity of Fraction II has a 
conductivity equal to that of Buffer A containing 100 mM NaCI. Dialyzed Fraction II is applied to the column 
at a flow rate of 100 ml/hr, and washed with 400 ml of Buffer A containing 100 NaCI. Proteins are eluted 
with a 3.5 L gradient from 100 to 400 mM NaCI in Buffer A at a flow rate of 60 ml/hr. Fractions containing 
20 T7 DNA polymerase, which elutes at 200 mM NaCI, are pooled. This is fraction III (190 ml). 

A column of Whatman P11 phosphocellulose (12.6 cm 2 x 11 cm) is prepared and washed with 20 mM 
KPO* pH 7.4/5 mM 2-mercaptoethanol/0.1 mM EDTA/10 % glycerol (Buffer B). Fraction III is diluted 2-fold 
(380 ml) with Buffer B. then applied to the column at a flow rate of 60 ml/hr and washed with 200 ml of 
Buffer B containing 100mM KCi. Proteins are eluted with a 1.8 L gradient from 100 to 400 mM KCI in Buffer 
25 B at a flow rate of 60 ml/hr. Fractions containing T7 DNA polymerase which elutes at 300 KCI, are pooled. 
This is fraction IV (370 ml). 

A column of DEAE-Sephadex A-50 (4.9 cm 2 x 15 cm) is prepared and washed with 20 mM Tris-HCI 
7.0/0.1 mM dithiothreitol/0.1 mM EDTA/10% glycerol (Buffer C). Fraction IV is dialyzed against two changes 
of 1 L Buffer C to a final conductivity equal to that of Buffer C containing 100 mM NaCI. Dialyzed fraction IV 
30 is applied to the column at a flow rate of 40 ml/hr, and washed with 150 ml of Buffer C containing 100 mM 
NaCI. Proteins are eluted with a 1 L gradient from 100 to 300 mM NaCI in Buffer C at a flow rate of 40 
ml/hr. Fractions containing T7 DNA polymerase, which elutes at 210 mM NaCI, are pooled. This is fraction 
V (120 ml). 

A column of BioRad HTP hydroxylapatite (4.9 cm 2 x 15 cm) is prepared and washed with 20 mM KPCU, 
35 pH 7.4/10 mM 2-mercaptoethanol/2 mM Na citrate/10% glycerol (Buffer D). Fraction V is dialyzed against 
two changes of 500 ml Buffer D each. Dialyzed fraction V is applied to the column at a flow rate of 30 ml/hr, 
and washed with 100 ml of Buffer D. Proteins are eluted with a 900 ml gradient from 0 to 180 mM KPO+, 
pH 7.4 in Buffer D at a flow rate of 30 ml/hr. Fractions containing T7 DNA polymerases which elutes at 50 
mM KPO*, are pooled. This is fraction VI (130 ml). It contains 270 mg of homogeneous T7 DNA 
40 polymerase. 

Fraction VI is dialyzed versus 20 mM KPO* pH 7.4/0.1 mM dithiothreitol/0.1 mM EDTA/50% glycerol. 
This is concentrated fraction VI (-65 ml, 4 mg/ml), and is stored at -20 "C. 

The isolated T7 polymerase has exonuclease activity associated with it. As stated above this must be 
inactivated. An example of inactivation by chemical modification follows. 
45 Concentrated fraction VI is dialyzed overnight against 20 mM KPO4 pH 7.4/0.1 mM dithiothreitol/10% 
glycerol to remove the EDTA present in the storage buffer. After dialysis, the concentration is adjusted to 2 
mg/ml with 20 mM KPO* pH 7.4/0.1 mM dithiothreitol/10% glycerol, and 30 ml (2mg/ml) aliquots are placed 
in 50 ml polypropylene tubes. (At 2 mg/ml, the molar concentration of T7 DNA polymerase is 22 uM.) 

Dithiothreitol (DTT) and ferrous ammonium sulfate (Fe(NH 4 )2(S04)26H2 0) are prepared fresh imme- 
50 diately before use, and added to a 30 ml aliquot of T7 DNA polymerase, to concentrations of 5 mM DTT 
(0.6 ml of a 250 stock) and 20uM FefNHtMSO^ehfeO (0.6 ml of a 1 mM stock). 

During modification the molar concentrations of T7 DNA polymerase and iron are each approximately 20 
UM, while DTT is in 250X molar excess. 

The modification is carried out at 0 • C under a saturated oxygen atmosphere as follows. The reaction 
55 mixture is placed on ice within a dessicator, the dessicator is purged of air by evacuation and subsequently 
filled with 100% oxygen. This cycle is repeated three times. The reaction can be performed in air (20% 
oxygen), but occurs at one third the rate. 
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The time course of loss of exonuclease activity is shown in Fig. 4. 3 H-Iabeled double-stranded DNA (6 
cpm/pmol) was prepared from bacteriophage T7 as described by Richardson (15 J. Molec. Biol. 49, 1966). 
3 H-labeled single-stranded 17 DNA was prepared immediately prior to use by denaturation of double- 
stranded 3 H-labeled T7 DNA with 50 mM NaOH at 20 *C for 15 min, followed by neutralization with HCI. 

5 The standard exonuclease assay used is a modification of the procedure described by Chase et al. (supra ). 
The standard reaction mixture (100 ul final volume) contained 40 mM Tris/HCI pH 7.5, 10 mM MgCI 2 , 10 
mM dithiothreitol, 60 nmol 3 H-labeled single-stranded 17 DNA (6 cpm/pm), and varying amounts of 17 DNA 
polymerase. 3 H-labeled double-stranded 17 DNA can also be used as a substrate. Also, any uniformly 
radioactively labeled DNA, single- or double-stranded, can be used for the assay. Also, 3' end labeled 

io single-or double-stranded DNA can be used for the assay. After incubation at 37 'C for 15 min, the reaction 
is stopped by the addition of 30 ul of BSA (10mg/ml) and 25 ul of TCA (100% w/v). The assay can be run 
at 10*C-45*C for 1-60 min. The DNA is precipitated on ice for 15 min (1 min - 12 hr), then centrifuged at 
12,000 g for 30 min (5 min - 3 hr). 100 ul of the supernatant is used to determine the acid-soluble 
radioactivity by adding it to 400 ul water and 5 ml of aqueous scintillation cocktail. 

75 One unit of exonuclease activity catalyzes the acid solubilization of 10 nmol of total nucleotide in 30 min 
under the conditions of the assay. Native T7 DNA polymerase has a specific exonuclease activity of 5000 
units/mg, using the standard assay conditions stated above. The specific exonuclease activity of the 
modified T7 DNA polymerase depends upon the extent of chemical modification, but ideally is at least 10- 
100-fold lower than that of native T7 DNA polymerase, or 500 to 50 or less units/mg using the standard 

20 assay conditions stated above. When double stranded substrate is used the exonuclease activity is about 7- 
fold higher. 

Under the conditions outlined, the exonuclease activity decays exponentially, with a half-life of decay of 
eight hours. Once per day the reaction vessel is mixed to distribute the soluble oxygen, otherwise the 
reaction will proceed more rapidly at the surface where the concentration of oxygen is higher. Once per day 
25 2.5 mM DTT (0.3 ml of a fresh 250 mM stock to a 30 ml reaction) is added to replenish the oxidized DTT. 

After eight hours, the exonuclease activity of T7 DNA polymerase has been reduced 50%, with 
, negligible loss of polymerase activity. The 50% loss may be the result of the complete inactivation of 
exonuclease" activity of half the polymerase molecules, rather than a general reduction of the rate of 
exonuclease activity in all the molecules. Thus, after an eight hour reaction all the molecules have normal 
30 polymerase activity, half the molecules have normal exonuclease activity, while the other half have <0.1% 
of their original exonuclease activity. 

When 50% of the molecules are modified (an eight hour reaction), the enzyme is suitable, although 
suboptimal, for DNA sequencing. For more optimum quality of DNA sequencing, the reaction is allowed to 
proceed to greater than 99% modification (having less than 50 units of exonuclease activity), which requires 
35 four days. 

After four days, the reaction mixture is dialyzed against 2 changes of 250 ml of 20 mM KPO* pH 7.4/0.1 
mM dithiothreitol/0.1 mM EDTA/50% glycerol to remove the iron. The modified T7 DNA polymerase (-4 
mg/ml) is stored at -20 • C. 

The reaction mechanism for chemical modification of T7 DNA polymerase depends upon reactive 
40 oxygen species generated by the presence of reduced transition metals such as Fe 2 * and oxygen. A 
possible reaction mechanism for the generation of hydroxy I radicals is outlined below: 

(1) Fe2 + + O2 - Fe 3+ + 0' 2 

45 (2) 2 0 2 + 2 H + - H2O2 + O2 

(3) Fe* + + H2O2 - FE 3 * + OH* + OH~ 

In equation 1, oxidation of the reduced metal ion yields superoxide radical, 0 2 . The superoxide radical 
50 can undergo a dismutation reaction, producing hydrogen peroxide (equation 2). Finally, hydrogen peroxide 
can react with reduced metal ions to form hydroxyl radicals, OH* (the Fenton reaction, equation 3). The 
oxidized metal ion is recycled to the reduced form by reducing agents such as dithiothreitol (DTT). 

These reactive oxygen species probably inactivate proteins by irreversibly chemically altering specific 
amino acid residues. Such damage is observed in SDS-PAGE of fragments of gene 5 produced by CNBr or 
55 trypsin. Some fragments disappear, high molecular weight cross linking occurs, and some fragments are 
broken into two smaller fragments. 

As previously mentioned, oxygen, a reducing agent (e.g. DTT, 2-mercaptoethanol) and a transition 
metal (e.g. iron) are essential elements of the modification reaction. The reaction occurs in air, but is 
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stimulated three-fold by use of 100% oxygen. The reaction will occur slowly in the absence of added 
transition metals due to the presence of trace quantities of transition metals (1-2uM) in most buffer 
preparations. 

As expected, inhibitors of the modification reaction include anaerobic conditions (e.g., N 2 ) and metal 

5 chelators (e.g. EDTA, citrate, nitrilotriacetate). In addition, the enzymes catalase and superoxide dismutase 
may inhibit the reaction, consistent with the essential role of reactive oxygen species in the generation of 
modified T7 DNA polymerase. 

As an alternative procedure, it is possible to genetically mutate the T7 gene 5 to specifically inactivate 
the exonuclease domain of the protein. The T7 gene 5 protein purified from such mutants is ideal for use in 

w DNA sequencing without the need to chemically inactivate the exonuclease by oxidation and without the 
secondary damage that inevitably occurs to the protein during chemical modification. 

Genetically modified T7 DNA polymerase can be isolated by randomly mutagenizing the gene 5 and 
then screening for those mutants that have lost exonuclease activity, without loss of polymerase activity. 
Mutagenesis is performed as follows. Single-stranded DNA containing gene 5 (e.g., cloned in pEMBL-8, a 

75 plasmid containing an origin for single stranded DNA replication) under the control of a 17 RNA polymerase 
promoter is prepared by standard procedure, and treated with two different chemical mutagens: hydrazine 
which will mutate C's and T's, and formic acid, which will mutate G's and A*s. Myers et al. 229 Science 242, 
1985. The DNA is mutagenized at a dose which results in an average of one base being altered per plasmid 
molecule. The single-stranded mutagenized plasmids are then primed with a universal 17-mer primer (see 

20 above), and used as templates to synthesize the opposite strands. The synthesized strands contain 
randomly incorporated bases at positions corresponding to the mutated bases in the templates. The double- 
stranded mutagenized DNA is then used to transform the strain K38/pGP1-2, which is strain K38 containing 
the plasmid pGP1-2 (Tabor et al., supra ). Upon heat induction this strain expresses T7 RNA polymerase. 
The transformed cells are plated at 30 • C, with approximately 200 colonies per plate. 

25 Screening for cells having T7 DNA polymerase lacking exonuclease activity is based upon the following 
finding. The 3' to 5' exonuclease of DNA polymerases serves a proofreading function. When bases are 
misincorporated, the exonuclease will remove the newly incorporated base which is recognized as 
"abnormal". This is the case for the analog of dATP, etheno-dATP, which is readily incorporated by T7 
DNA polymerase in place of dATP. However, in the presence of the 3' to 5' exonuclease of 17 DNA 

30 polymerase, it is excised as rapidly as it is incorporated, resulting in no net DNA synthesis. As shown in 
figure 6, using the alternating copolymer poly d(AT) as a template, native 17 DNA polymerase catalyzes 
extensive DNA synthesis only in the presence of dATP, and not etheno-dATP. In contrast, modified 17 DNA 
polymerase, because of its lack of an associated exonuclease, stably incorporates etheno-dATP into DNA at 
a rate comparable to dATP. Thus, using poly d(AT) as a template, and dTTP and etheno-dATP as 

35 precursors, native 17 DNA polymerase is unable to synthesize DNA from this template, while 17 DNA 
polymerase which has lost its exonuclease activity will be able to use this template to synthesize DNA. 

The procedure for lysing and screening large number of colonies is described in Raetz (72 Proc. Nat. 
Acad. Sci. 2274, 1975). Briefly, the K38/pGP1-2 cells transformed with the mutagenized gene 5-containing 
plasmids are transferred from the petri dish, where they are present at approximately 200 colonies per 

40 plate, to a piece of filter paper ("replica plating"). The filter paper discs are then placed at 42 *C for 60 min 
to induce the T7 RNA polymerase, which in turn expresses the gene 5 protein. Thioredoxin is constitutively 
produced from the chromosomal gene. Lysozyme is added to the filter paper to lyse the cells. After a 
freeze thaw step to ensure cell lysis, the filter paper discs are incubated with poly d(AT), [a 32 P]dTTP and. 
etheno-dATP at 37 *C for 60 min. The filter paper discs are then washed with acid to remove the 

45 unincorporated pPJdATP. DNA will precipitate on the filter paper in acid, while nucleotides will be soluble. 
The washed filter paper is then used to expose X-ray film. Colonies which have induced an active T7 DNA 
polymerase which is deficient in its exonuclease will have incorporated acid-insoluble ^P, and will be visible 
by autoradiography. Colonies expressing native T7 DNA polymerase, or expressing a T7 DNA polymerase 
defective in polymerase activity, will not appear on the autoradiograph. 

so Colonies which appear positive are recovered from the master petri dish containing the original 
colonies. Cells containing each potential positive clone will be induced on a larger scale (one liter) and T7 
DNA polymerase purified from each preparation to ascertain the levels of exonuclease associated with each 
mutant. Those low in exonuclease are appropriate for DNA sequencing. 

Directed mutagenesis may also be used to isolate genetic mutants in the exonuclease domain of the T7 

55 gene 5 protein. The following is an example of this procedure. 

T7 DNA polymerase with reduced exonuclease activity (modified T7 DNA polymerase) can also be 
distinguished from native T7 DNA polymerase by its ability to synthesize through regions of secondary 
structure. Thus, with modified DNA polymerase, DNA synthesis from a labeled primer on a template having 
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secondary structure will result in significantly longer extensions, compared to unmodified or native DNA 
polymerase. This assay provides a basis for screening for the conversion of small percentages of DNA 
polymerase molecules to a modified form. 

The above assay was used to screen for altered T7 DNA polymerase after treatment with a number of 

5 chemical reagents. Three reactions resulted in conversion of the enzyme to a modified form. The first is 
treatment with iron and a reducing agent, as described above. The other two involve treatment of the 
enzyme with photooxidizing dyes, Rose Bengal and methylene blue, in the presence of light. The dyes 
must be titrated carefully, and even under optimum conditions the specificity of inactivation of exonuclease 
activity over polymerase activity is low, compared to the high specificity of the iron-induced oxidation. Since 

10 these dyes are quite specific for modification of histidine residues, this result strongly implicates histidine 
residues as an essential species in the exonuclease active site. 

There are 23 histidine residues in T7 gene 5 protein. Eight of these residues lie in the amino half of the 
protein, in the region where, based on the homology with the large fragment of E. coli DNA polymerase I, 
the exonuclease domain may be located (Ollis et al. Nature 313, 818. 1984). As described below, seven of 

75 the eight histidine residues were mutated individually by synthesis of appropriate oligonucleotides, which 
were then incorporated into gene 5. These correspond to mutants 1, and 6-10 in table 1. 

The mutations were constructed by first cloning the T7 gene 5 from pGP5-3 (Tabor et al. t J. Biol. 
Chem. 282, 1987) into the Sma l and Hindlll sites of the vector M13 mp18, to give mGP5-2. (The vector 
used and the source of gene 5 are not critical in this procedure.) Single-stranded mGP5-2 DNA was 

20 prepared from a strain that incorporates deoxy uracil in place of deoxy thymidine (Kunkel, Proc. Natl. Acad. 
ScL USA 82, 488, 1985). This procedure provides a strong selection for survival of only the synthesized 
strand (that containing the mutation) when transfected into wild-type E. Coli , since the strand containing 
uracil will be preferentially degraded. 

Mutant oligonucleotides, 15-20 bases in length, were synthesized by standard procedures. Each 

25 oligonucleotide was annealed to the template extended using native T7 DNA polymerase and ligated using 
T4 DNA ligase. Covalently closed circular molecules were isolated by agarose gel electrophoresis run in the 
presence of 0.5ng/ml ethidium bromide. The resulting purified molecules were then used to transform E. 
coli 71.18. DNA from the resulting plaques was isolated and the relevant region sequenced to confirm each 
mutation. 

30 The following summarizes the oligonucleotides used to generate genetic mutants in the gene 5 
exonuclease. The mutations created are underlined. Amino acid and base pair numbers are taken from 
Dunn et al.. 166 J. Molec. Biol. 477, 1983. The relevant wild type sequences of the region of gene 5 
mutated are also shown. 

35 
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Wild type sequence: 



1 9.? < aa > 122 123 

5 leu Leu Arg Ser Gly Lys Leu Pro Gly Lys Arg Phe Gly Ser His Ala Leu Giu 
CTT CTG CGT TCC GGC AAG TTG CCC GGA AAA CGC TTT GGG TCT CAC GCT TTG GAG 
14677 (T7 bp) 



10 



15 



20 



25 



30 



Mutation 1: His 123 -* Ser 123 

Priaer used: 5' CGC TTT GGA TCC GCT TTG 3* 

Mutant sequence: 

123 

Leu Leu Arg Ser Gly Lys Leu Pro Gly Lys Argr Phe Gly Ser S&Z Ala Leu Glu 
CTT CTG CGT TCC GGC AAG TTG CCC GGA AAA CGC TTT GG& TCC ZCC GCT TTG GAG 

Mutation 2: Deletion of Ser 122 and His 123 

Priner used: 5 ' GGA AAA CGC TTT GGC GCC TTG GAG GCG 3' 

A 

6 base deletion 

Mutant sequence: 

122 123 

*eu Leu Arg Ser Gly Lys Leu Pro Gly Lys Arg Phe Gly Aia Leo Glu 

CTT CTG CGT TCC GGC AAG TTG CCC GGA AAA CGC TTT GGC GCC TTG GAG 
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Mutation 3: Ser 122, His 123 -> Ala 122, Glu 123 
Primer used: 5' CGC TTT GGG GCT GAG GCT TTG G 3' 
Mutant sequence: 

122 123 

Leu Leu Arg Ser Gly Lys Leu Pro Gly Lys Arg Phe Gly &lA Glu Ala Leu Glu 
CTT CTG CGT TCC GGC AAG TTG CCC GGA AAA CGC TTT GGG QCT fiAfi GCT TTG GAG 



Mutation 4: Lys 118, Arg 119 Glu 118, Glu 119 

Primer used: 5* 5 f G CCC GGG GAA GAG TTT GGG TCT CAC GC 3» 

Mutant sequence: 

118 119 

Leu Leu Arg Ser Gly Lys Leu Pro Gly Glu Glu Phe Gly Ser His Aia Leu Glu 
CTT CTG CGT TCC GGC AAG TTG CCC GGG £AA GAG TTT GGG TCT CAC GCT TTG GAG 

Mutation 5:. Arg 111, Ser 112, Lys 114 -> Glu 111, Ala 112, Glu 114 

Primer used : 5' G GGT CTT CTG GAA GCC GGC GAG TTG CCC GG 3 1 

Mutant sequence: 

111 112 114 

Leu Leu Glu Ala Gly Glu Leu Pro Gly Lys Arg Phe Gly Ser His Ala Leu 
Glu 

CTT CTG GAA GCC GGC GAG TTG CCC GGA AAA CGC TTT GGG TCT CAC GCT TTG GAG 



Mutation 6: His 59, His 62 -» Ser 59, Ser 62 
35 Primer used: 5' ATT GTG TTC 2CC AAC GGA ICC AAG TAT GAC G 3t 
wild-type sequence: 

aa: 55 59 62 

Leu lie Val Phe His Asn Gly His Lys Tyr Asp Val 
40 CTT ATT GTG TTC CAC AAC GGT CAC AAG TAT GAC GTT 

T7 bp: 14515 

Mutant sequence: 

59 62 

45 Leu lie val Phe Asn Gly Ser Lys Tyr Asp Val 

CTT ATT GTG TTC 2£C AAC GGA ICC AAG TAT GAC GTT 



50 



55 
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Mutation 7: His 82 -> Ser 82 

Primer used: 5' GAG TTC 2£P CTT CCT CG 3' 

Wild-type sequence: 

aa: 77 82 

Leu Asn Arg Glu Phe His Leu Pro Arg Glu Asn 
TTG AAC CGA GAG TTC CAC CTT CCT CGT GAG AAC 
T7 bp: 14581 

Mutant sequence: 

82 

Leu Asn Arg Glu Phe Ser Leu Pro Arg Glu Asn 
TTG AAC CGA GAG TTC ICC CTT CCT CGT GAG AAC 



75 



20 



40 



Mutation 8: Arg 96, His 99 -> Leu 96, Ser 99 
Primer used: 5 f CIS TTG ATT ICT TCC AAC CTC 3' 
Wild-type sequence: 



aa: 93 96 99 

Val Leu Ser Arg Leu He His Ser Asn Leu Lys Asp Thr Asp 
25 GTG TTG TCA CGT TTG ATT CAT TCC AAC CTC AAG GAC ACC GAT 

T7 bp.: 14629 

Mutant sequence: 

96 99 

Val Leu Ser Zsu Leu lie Ser Ser Asn Leu Lys Asp Thr Asp 
30 GTG TTG TCA Clfi TTG ATT X£T TCC AAC CTC AAG GAC ACC GAT 



Mutation 9: His 190 -> Ser 190 

35 Primer used: 5 1 CT GAC AAA i£T TAC TTC CCT 3' 

Wild-type sequence: 

aa: 185 190 

Leu Leu Ser Asp Lys His Tyr Phe Pro Pro Glu 

CTA CTC TCT GAC AAA CAT TAC TTC CCT CCT GAG 
T7 bp: 14905 



Mutant sequence: 

190 

45 Leu Leu Ser Asp Lys £er Tyr Phe Pro Pro Glu 

CTA CTC TCT GAC AAA X£T TAC TTC CCT CCT GAG 



50 
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Mutation 10: His 218 -» Ser 218 

Priaer u«d: 5' GAC ATT GAA I£T CGT GCT GC 3« 

Wild-type sequence: 

... 214 218 

* Val Asp Tie Giu His Arg Ala Ala Trp Leu Leu 
GTT GAC ATT GAA CAT CGT GCT GCA TGG CTG CTC 
T7 bp: 14992 



Mutant sequence: 



218 



val Asp lie Glu l££ Arg Ala Ala Trp Leu Leu 
GTT GAC ATT GAA ££T CGT GCT GCA TGG CTG CTC 



75 



20 



Mutation 11*: Deletion of amino acids 118 to 123 
Primer used: 5* C GGC AAG TTG CCC GGfi GCT TTG GAG GCG TGG G 3* 

18 base deletion 



25 



30 



Wild-type sequence: 

109 (aa) 118 X 22 123 126 

Leu Leu Arg Ser Gly Lys Leu Pro Gly Lys Arg Phe Gly Ser His Ala Leu Glu 
CTT CTG CGT TCC GGC AAG TTG CCC GGA AAA CGC TTT GGG TCT CAC GCT TTG GAG 
14677 (T7 bp) 



35 



40 



Mutant sequence: 

117 124 

Leu Leu Arg Ser Gly Lys Leu Pro Gly (6 amino acids) Ala Leu Glu 

CTT CTG CGT TCC GGC AAG TTG CCC GGG (18 bases) GCT TTG GAG 



Mutation 12: His 123 -* Glu 123 
Primer used: 5 1 GGG TCT GAG GCT TTG G 3' 
Mutant sequence: v 

45 123 

Leu Leu Arg Ser Gly Lys Leu Pro Gly Lys Arg Phe Gly Ser Qlu Ala Leu Glu 
CTT CTG CGT TCC GGC AAG TTG CCC GGA AAA CGC TTT GGG TCT GAG GCT TTG GAG 



50 



55 
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Kat»tion 13 : (Arg 131. Ly$ 136, Lys 140, Lys 144, Arg 145 

Giu 131, Glu 136, GIu 140, Glu 144, Glu 145) 



Priaar used: 5* GGT TAT fiifi CC GGC GAG ATG £AG GGT GAA TAC fiAA GAC GAC TTT GAG CSA ATG 
CTT GAA G 3* 



Wild-type sequence; 

129 (aa> 131 136 140 144 145 

Gly Tyr Arg Leu Gly Glu Met Lys Gly Glu Tyr Lys Asp Asp Phe Lys Arg Wee Leu Gla Glu 
GGT TAT CGC TTA GGC GAG ATG AAG GGT GAA TAC AAA GAC GAC TTT AAG CGT ATG CTT GAA G 
14737 {T7 bp) 



75 



20 



Mutant sequence: 

129 (aa) 131 136 140 144 145 

Gly Tyr Glu Leu Gly Glu Met Glu Gly Glu Tyr GJji Asp Asp Phe Glu Glu Met Leu Giu Glu 

GGT TAT £ifi CC GGC GAG ATG fiAG GGT GAA TAC fiAA GAC GAC TTT GAG GAA ATG CTT GAA G 
14737 (T7 bp) 



25 



30 



Each mutant gene 5 protein was produced by infection of the mutant phage into K38/pGP1-2, as 
follows. The cells were grown at 30 * C to an A590 = 1 -0. The temperature was shifted to 42 • C for 30 min. ( to 
induce T7 RNA polymerase. IPTG was added to 0.5 mM, and a lysate of each phage was added at a 
moi = 10. Infected cells were grown at 37*C for 90 min. The cells were then harvested and extracts 
prepared by standard procedures for 17 gene 5 protein. 

Extracts were partially purified by passage over a phosphocellulose and DEAE A-50 column, and 
assayed by measuring the polymerase and exonuclease activities directly, as described above. The results 
are shown in Table 1 . 



35 



40 



Table 1 

SUMMARY OF EXONUCLEASE AND POLYMERASE 
ACTIVITIES OF T7 GENE 5 MUTANTS 



Mutant 



Exonuclease 
activity, % 



Polymerase 
activity, \ 



50 



[Wild-type] 
Mutant 1 

(His 123 -* Sex 123) 
Mutant 2 

(A Sex 122, His 123) 
Mutant 3 

(Scr 122, His 123 



► Ala 122, Glu 123) 



[100]* 

10-25 
0.2-0.4 

<2 



[100]b 
>90 
>90 
>90 
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Table 1 

SUMMARY OF EXONUCLEASE AND POLYMERASE 
ACTIVITIES OF T7 GENE 5 MUTANTS 

Exonuclease Polymerase 
Mutant activity, % activity, % 

Mutant 4 

(Lysll8,Argll9->Glull8,Glull9) <30 >90 

MurantS 

(Arg 111, Sex 112, Lys 114-> 

Glu 111, Ala 112, Glu 114) >75 >90 

Mutant 6 

(His 59, His 62 Ser 59 t Set 62) >75 >90 

Mutant7 

(His 82 Scr 82) >75 >90 

Mutant 8 

(Arg96,HIs99-»Leu96,Ser99) >75 >90 

Mutant 9 

(His 190 -» Ser 190) >75 >90 

Mutant 10 

(His218->Ser218) >75 >90 

Mutant 11 
(A Lys 1 18, Arg 119, Phe 120, 
Gly 121, Scr 122, His 123) <0.02 >90 

Mutant 12 

(His 123 Glu 123) <30 >90 

Mutant 13 

(Arg 131, Lys 136, Lys 140, Lys 144, Arg 145 

Glu 131, Glu 136, Glu 140, Glu 144, Glu 145) <30 >90 

ZX ?noa lease a « ivi -V v ? s .measured on single stranded [3 H ] T 7 
100% exonuclease activity corresponds to 5,000 unirs/^g 

dna ? °te r ^?, a " ivity r«- measured using single-stranded calf thymus 
1001 -.Polymerase activity corresponds to 8,000 units/ag. r -^ s 



Of the seven histidines tested, only one (His 123: mutant 1) has the enzymatic activities characteristic 
of modified T7 DNA polymerase. T7 gene 5 protein was purified from this mutant using DEAE-cellulose, 
phosphocellulose, DEAE-Sephadex and hydroxylapatite chromatography. While the polymerase activity was 
nearly normal (>90% the level of the native enzyme), the exonuclease activity was reduced 4 to 10-fold. 

A variant of this mutant was constructed in which both His 123 and Ser 122 were deleted. The gene 5 
protein purified from this mutant has a 200-500 fold lower exonuclease activity, again with retention of 
>90% of the polymerase activity. 

These data strongly suggest that His 123 lies in the active site of the exonuclease domain of T7 gene 5 
protein. Furthermore, it is likely that the His 123 is in fact the residue being modified by the oxidation 
involving iron, oxygen and a reducing agent, since such oxidation has been shown to modify histidine 
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residues in other proteins (Levine, J. Biol. Chem. 258: 11823, 1983; and Hodgson et al. Biochemistry 14: 
5294, 1975). The level of residual exonuclease in mutant 11 is comparable to the levels obtainable 
chemical modification. 

Although mutations at His residues are described, mutations at nearby sites or even at distant sites may 
5 also produce mutant enzymes suitable in this invention, e.g., lys and arg (mutants 4 and 15). Similarly, 
although mutations in some His residues have little effect on exonuclease activity that does not necessarily 
indicate that mutations near these residues will not affect exonuclease activity. 

Mutations which are especially effective include those having deletions of 2 or more amino acids, preferably 
6-8, for example, near the His-123 region. Other mutations should reduce exonuclease activity further, or 
10 completely. 

As an example of the use of these mutant strains the following is illustrative. A pGP5-6 (mutation 11)- 
containing strain has been deposited with the ATCC (see below). The strain is grown as described above 
and induced as described in Taber et al. J. Biol. Chem. 262:16212 (1987). K38/pTrx-2 cells may be added 
to increase the yield of genetically modified T7 DNA polymerase. 
75 The above noted deposited strain also contains plasmid pGP1-2 which expresses T7 RNA polymerase. 
This plasmid is described in Tabor et al., Proc. Nat. Acad. Sci. USA 82:1074, 1985 and was deposited with 
the ATCC on March 22, 1985 and assigned the number 40,175. 
Referring to Fig. 10, pGP5-6 includes the following segments: 
1. Eco RI-Sacl -Sma l- Bam HI poiylinker sequence from M13 mp10 (21 bp). 
20 2. T7 bp 14309 to 16747, that contains the T7 gene 5, with the following modifications: 
T7 bp 14703 is changed from an A to a G, creating a Smal site. 
T7 bp 14304 to 14321 inclusive are deleted (18 bp). 

3. Sall-Pstl-Hindlll poiylinker sequence from M13 mp 10 (15 bp) 

4. pBR322 bp 29 (Hindlll site) to pBR322 bp 375 (Bam HI site). 

25 5. T7 bp 22855 to T7 bp 22927, that contains the T7 RNA Polymerase promoter <J>10 with Bam HI linkers 
inserted at each end (82 bp). 

6. pBR322 bp 375 (BamHI site) to pBR322 bp 4361 (EcoRI site). 
DNA Sequencing Using Modified T7-type DNA Polymerase 

30 

DNA synthesis reactions using modified T7-type DNA polymerase result in chain-terminated fragments 
of uniform radioactive intensity, throughout the range of several bases to thousands of bases in length. 
There is virtually no background due to terminations at sites independent of chain terminating agent 
incorporation (i.e. at pause sites or secondary structure impediments). 

35 Sequencing reactions using modified T7-type DNA polymerase consist of a pulse and chase. By pulse 
is meant that a short labelled DNA fragment is synthesized; by chase is meant that the short fragment is 
lengthened until a chain terminating agent is incorporated. The rationale for each step differs from 
conventional DNA sequencing reactions. In the pulse, the reaction is incubated at 0*C-37*C for 0.5-4 min 
in the presence of high levels of three nucleotide triphosphates (e.g., dGTP, dCTP and dTTP) and limiting 

40 levels of one other labelled, carrier-free, nucleotide triphosphate, e.g., l^S) dATP. Under these conditions 
the modified polymerase is unable to exhibit its processive character, and a population of radioactive 
fragments will be synthesized ranging in size from a few bases to several hundred bases. The purpose of 
the pulse is to radiqactively label each primer, incorporating maximal radioactivity while using minimal 
levels of radioactive nucleotides. In this example, two conditions in the pulse reaction (low temperature, e.g., 

45 from 0-20 *C, and limiting levels of dATP, e.g., from 0.1uM to 1uM) prevent the modified T7-type DNA 
polymerase from exhibiting its processive character. Other essential environmental components of the 
mixture will have similar effects, e.g. limiting more than one nucleotide triphosphate or increasing the ionic 
strength of the reaction. If the primer is already labelled (e.g., by kinasing) no pulse step is required. 

In the chase, the reaction is incubated at 45 *C for 1-30 min in the presence of high levels (50-500uM) 

so of all four deoxynucleoside triphosphates and limiting levels (1-50uM) of any one of the four chain 
terminating agents, e.g., dideoxynucleoside triphosphates, such that DNA synthesis is terminated after an 
average of 50-600 bases. The purpose of the chase is to extend each radioactively labeled primer under 
conditions of processive DNA synthesis, terminating each extension exclusively at correct sites in four 
separate reactions using each of the four dideoxynucleoside triphosphates. Two conditions of the chase 

55 (high temperature, e.g., from 30-50 'Q and high levels (above 50uM) of all four deoxynucleoside 
triphosphates) allow the modified T7-type DNA polymerase to exhibit its processive character for tens of 
thousands of bases; thus the same polymerase molecule will synthesize from the primer-template until a 
dideoxynucleotide is incorporated. At a chase temperature of 45 *C synthesis occurs at >700 
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nucleotides/sec. Thus, for sequencing reactions the chase is complete in less than a second, ssb increases 
processivity, for example, when using dITP, or when using low temperatures or high ionic strength, or low 
levels of triphosphates throughout the sequencing reaction. 

Either [a^SJdATP, [a 32 P]dATP or fluorescently labelled nucleotides can be used in the DNA sequen- 

s cing reactions with modified T7-type DNA polymerase. If the fluorescent analog is at the 5' end of the 
primer, then no pulse step is required. 

Two components determine the average extensions of the synthesis reactions. First is the length of 
time of the pulse reaction. Since the pulse is done in the absence of chain terminating agents, the longer 
the pulse reaction time, the longer the primer extensions. At 0 # C the polymerase extensions average 10 

io nucleotides/sec. Second is the ratio of deoxyribonucleoside triphosphates to chain terminating agents in the 
chase reaction. A modified T7-type DNA polymerase does not discriminate against the incorporation of 
these analogs, thus the average length of extension in the chase is four times the ratio of the deox- 
ynucleoside triphosphate concentration to the chain terminating agent concentration in the chase reaction. 
Thus, in order to shorten the average size of the extensions, the pulse time is shortened, e.g., to 30 sec. 

75 and/or the ratio of chain terminating agent to deoxy nucleoside triphosphate concentration is raised in the 
chase reaction. This can be done either by raising the concentration of the chain terminating agent or 
lowering the concentration of deoxynucleoside triphosphate. To increase the average length of the 
extensions, the pulse time is increased, e.g., to 3-4 min; and/or the concentration of chain terminating agent 
is lowered (e.g., from 20uM to 2uM) in the chase reaction. 

20 

Example 2 : DNA sequencing using modified T7 DNA polymerase 

The following is an example of a sequencing protocol using dideoxy nucleotides as terminating agents. 
9ul of single-stranded M13 DNA (mGP1-2, prepared by standard procedures) at 0.7 mM concentration 

25 is mixed with 1 ul of complementary sequencing primer (standard universal 17-mer, 0.5 pmole primer / ul) 
and 2.5 ul 5X annealing buffer (200 mM Tris-HCI, pH 7.5, 50 mM MgCI 2 ) heated to 65 °C for 3 min, and 
slow cooled to room temperature over 30 min. In the pulse reaction, 12.5 ul of the above annealed mix was 
mixed with 1 ul dithiothreitol 0.1 M, 2 ul of 3 dNTPs (dGTP, dCTP, dTTP) 3 mM each (P.L Biochemicals in 
TE), 2.5 ul [a^SJdATP, (1500 Ci/mmol, New England Nuclear) and 1 ul of modified T7 DNA polymerase 

30 described in Example 1 (0.4 mg/ml, 2500 units/ml, i.e. 0.4 ug, 2.5 units) and incubated at 0*C, for 2 min, 
after vortexing and centrifuging in a microfuge for 1 sec. The time of incubation can vary from 30 sec to 20 
min and temperature can vary from 0*C to 37 *C. Longer times are used for determining sequences distant 
from the primer. 

4.5 ul aliquots of the above pulse reaction are added to each of four tubes containing the chase mixes, 
35 preheated to 45 *C. The four tubes, labeled G, A, T, C, each contain trace amounts of either dideoxy (dd) 
G, A, T, or C (P-L Biochemicals). The specific chase solutions are given below. Each tube contains 1.5 ul 
dATP 1mM, 0.5 ul 5X annealing buffer (200 mM Tris-HCI, pH 7.5, 50mM MgCfe), and 1.0 ul ddNTP 100 
uM (where ddNTP corresponds to ddG.AJ or C in the respective tubes). Each chase reaction is incubated 
at 45 °C (or 30°C-50*C) for 10 min, and then 6 ul of stop solution (90% formamide, 10mM EDTA, 0.1% 
40 xylenecyanol) is added to each tube, and the tube placed on ice. The chase times can vary from 1-30 min. 

The sequencing reactions are run on standard, 6% polyacrylamide sequencing gel in 7M urea, at 30 
Watts for 6 hours. Prior to running on a gel the reactions are heated to 75 *C for 2 min. The gel is fixed in 
10% acetic acid, 10% methanol, dried on a. gel dryer, and exposed to Kodak OM1 high-contrast 
autoradiography film overnight. 

45 

Example 3: DNA sequencing using limiting concentrations of dNTPS 

In this example DNA sequence analysis of mGP1-2 DNA is performed using limiting levels of all four 
deoxyribonucleoside triphosphates in the pulse reaction. This method has a number of advantages over the 

so protocol in example 2. First, the pulse reaction runs to completion, whereas in the previous protocol it was 
necessary to interrupt a time course. As a consequence the reactions are easier to run. Second, with this 
method it is easier to control the extent of the elongations in the pulse, and so the efficiency of labeling of 
sequences near the primer (the first 50 bases) is increased approximately 10-fold. 

7 ul of 0.75 mM single-stranded M13 DNA (mGP1-2) was mixed with 1ul of complementary sequencing 

55 primer (17-mer, 0.5 pmole primer/ul) and 2 ul 5X annealing buffer (200 mM Tris-HCI pH 7.5, 50 mM MgCI 2 , 
250 mM NaCI) heated at 65 "C for 2 min, and slowly cooled to room temperature over 30 min. In the pulse 
reaction 10 ul of the above annealed mix was mixed with 1 ul dithiothreitol 0.1 M, 2 ul of 3 dNTPs (dGTP, 
dCTP, dTTP) 1.5 uM each, 0.5ul [[a^SJdATP, ( a 10uM) (about 10uM, 1500 Ci/mmol, New England 



19 



EP 0 386 859 B1 



Nuclear) and 2 ul modified T7 DNA polymerase (0.1 mg/ml, 1000 units/ml, i.e., 0.2 ug, 2 units) and 
incubated at 37 *C for 5 min. (The temperature and time of incubation can be varied from 20*C-45*C and 
1-60 min., respectively.) 

3.5 nl aliquots of the above pulse reaction were added to each of four tubes containing the chase 
5 mixes, which were preheated to 37 *C. The four tubes, labeled G, A, T, C, each contain trace amounts of 
either dideoxy G, A, T, C. The specific chase solutions are given below. Each tube contains 0.5 ul 5X 
annealing buffer (200 mM Tris-HCI pH 7.5, 50 mM MgCI 2 , 250 mM NaCI), 1 ul 4dNTPs (dGTP, dATP, 
dTTP, dCTP) 200 uM each, and 1.0 ul ddNTP 20 uM. Each chase reaction is incubated at 37 # C for 5 min 
(or 20*C-45*C and 1-60 min respectively), and then 4 ul of a stop solution (95% formamide, 20 mM 
10 EDTA, 0.05% xylene-cyanol) added to each tube, and the tube placed on ice prior to running on a standard 
polyacrylamide sequencing gel as described above. 

Example 4: Replacement of dGTP with dITP for DNA sequencing 

75 In order to sequence through regions of compression in DNA, i.e., regions having compact secondary 
structure, it is common to use dITP (Mills et al., 76 Proc. Natl. Acad. Sci. 2232, 1979) or deazaguanosine 
triphosphate (deaza GTP, Mizusawa et al., 14 Nuc. Acid Res. 1319, 1986). We have found that both analogs 
function well with T7-type polymerases, especially with dITP in the presence of ssb. Preferably these 
reactions are performed with the above described genetically modified T7 polymerase, or the chase 

20 reaction is for 1-2 min., and/or at 20 * C to reduce exonuclease degradation. 

Modified T7 DNA polymerase efficiently utilizes dITP or deoza-GTP in place of dGTP. dITP is 
substituted for dGTP in both the pulse and chase mixes at a concentration two to five times that at which 
dGTP is used. In the ddG chase mix ddGTP is still used (not ddlTP). 

The chase reactions using dITP are sensitive to the residual low levels (about 0.01 - units) of ex- 

25 onuclease activity. To avoid this problem, the chase reaction times should not exceed 5 min when dITP is 
used. It is recommended that the four dITP reactions be run in conjunction with, rather than to the exclusion 
of, the four reactions using dGTP. If both dGTP and dITP are routinely used, the number of required mixes 
can be minimized by: (1) Leaving dGTP and dITP out of the chase mixes, which means that the four chase 
mixes can be used for both dGTP and dITP chase reactions. (2) Adding a high concentration of dGTP or 

30 dITP (2ul at 0.5 mM and 1-2.5 mM respectively) to the appropriate pulse mix. The two pulse mixes then 
each contain a low concentration of dCTP.dTTP and [a^SJdATP, and a high concentration of either dGTP 
or dITP. This modification does not usually adversely effect the quality of the sequencing reactions, and 
reduces the required number of pulse and chase mixes to run reactions using both dGTP and dITP to six. 
The sequencing reaction is as for example 3, except that two of the pulse mixes contain a) 3 dNTP mix 

35 for dGTP: 1.5 uM dCTP.dTTP, and 1 mM dGTP and b) 3 dNTP mix for dITP: 1.5 uM dCTP.dTTP, and 2 
mM dITP. In the chase reaction dGTP is removed from the chase mixes (i.e. the chase mixes contain 30 
UM dATP,dTTP and dCTP, and one of the four dideoxynucleotides at 8 uM), and the chase time using 
dITP does not exceed 5 min. 

40 Deposits 

Strains K38/pGP5-5/pTrx-2, K38/pTrx-2 and M13 mGP1-2 have been deposited with the ATCC and 
assigned numbers 67,287, 67,286, and 40,303 respectively. These deposits were made on January 13, 
1987. Strain K38/pGP1-2/pGP5-6 was deposited with the ATCC. On December 4, 1987, and assigned the 
45 number 67571. 

Applicants' and their assignees acknowledge their responsibility to replace these cultures should they 
die before the end of the term of a patent issued hereon, 5 years after the last reguest for a culture, or 30 
years, whichever is the longer, and its responsibility to notify the depository of the issuance of such a 
patent, at which time the deposits will be made irrevocably available to the public. Until that time the 
so deposits will be made irrevocably available to the Commissioner of Patents under the terms of 37 CFR 
Section 1-14 and 35 USC Section 112. 

Claims 

55 1. A method of producing a purified modified DNA polymerase which method comprises expressing a 
modified gene which gene encodes a modified processive DNA polymerase which has sufficient DNA 
polymerase activity for use in DNA sequencing when said polymerase is combined with any cofactor 
necessary for said DNA polymerase activity and which results from the modification of a naturally 

20 



EP 0 386 859 B1 



occuring gene modified in that one or more amino acids in the 3* - 5' exonuciease domain of said 
naturally ocurring DNA polymerase are replaced by an amino acid other than that naturally occuring at 
the site of substitution or are deleted so as to reduce the activity of naturally occurring 3* - 5' 
exonuciease activity of the naturally occurring DNA polymerase. 

5 

2. A method according to claim 1 characterised in that the polymerase activity of the modified DNA 
polymerase is at least 90% of that of the naturally occurring DNA polymerase. 

3. A method according to claim 1 or claim 2 further characterised in that said polymerase is a modified 
70 bacteriophage T7-type DNA polymerase which has a 3' - 5' exonuciease activity at least 50% lower 

than the naturally-occurring exonuciease activity of naturally occurring T7-type DNA polymerase. 

4. A method according to any of claims 1 to 3 characterised in that said gene encodes a processive 
modified DNA polymerase modified to reduce the activity of the naturally occurring 3* - 5* exonuciease 

75 activity to less than 500 units per milligram of polymerase. 

5. A method according to any one of claims 1 to 4 further characterised in that said gene has been 
modified to eliminate the naturally occurring exonuciease activity of the naturally occurring DNA 
polymerase. 

20 

6. A method according to claim 4 wherein a naturally occurring His residue of the naturally occurring DNA 
polymerase is replaced or deleted. 

7. A method according to any of claims 4 to 6 characterised in that said processive modified T7-type DNA 
25 polymerase is T7 DNA polymerase. 

8. A method according to claim 7 characterised in that His 123 of the naturally occurring 17 DNA 
polymerase is replaced or deleted. 

30 9. A method according to claim 7 characterised in that Ser 122 and His 123 are replaced or deleted. 

10. A method according to claim 7 characterised in that amino acid residues Lys 118 to His 123 are 
deleted. 

35 11. A method according to claim 7 or claim 9 characterised in that Lys 118 and Arg 119 of the naturally 
occurring T7 DNA polymerase are replaced or deleted. 

12. A method accord claim 7 characterised in that Arg 131, Lys 136, Lys 140, Lys 144 and Arg 145 of 
naturally occurring T7 DNA polymerase are replaced or deleted. 

40 

13. A method according to any of claims 1 to 12 wherein the modified processive DNA polymerase is able 
to remain bound to DNA for at least 500 bases under conditions normally used for DNA sequencing 
reactions. 

45 14. A purified modified gene characterised in that it encodes a processive modified T7-type DNA 
polymerase which polymerase is able to remain bound to DNA for at least 500 bases under conditions 
normally used for DNA sequencing reactions and which has sufficient DNA polymerase activity for use 
in DNA sequencing when said polymerase is combined with any host factor necessary for said DNA 
polymerase activity and which results from the modification of a naturally occurring gene modified to 

50 reduce the activity of naturally occurring 3'- 5* exonuciease activity of the naturally occurring DNA 
polymerase wherein one or more amino acids of the exonuciease domain within the amino terminal half 
of the T7 DNA polymerase of said naturally occurring DNA polymerase, or the corresponding domain of 
other T7-type DNA polymerases, are replaced by an amino acid other than that naturally occurring at 
the site of substitution or are deleted. 

55 

15. A purified modified gene according to claim 14 characterised in that one or more of the amino acids of 
the exonuciease domain from the amino terminal to amino acid residue 224 of T7 DNA polymerase of 
said naturally occurring DNA polymerase, or the corresponding domain of other T7-type DNA poly- 
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merases, are replaced by an amino acid other than that naturally occurring at the site of substitution or 
are deleted. 

16. A purified modified gene according to claims 14 or 15 characterised in that the polymerase activity of 
5 the processive T7-type modified polymerase is at least 90% of that of the naturally occurring T7-type 

DNA polymerase. 

17. A purified modified gene according to any of claims 14 to 15 further characterised in that said 
polymerase has an exonuclease activity at least 50% lower than the naturally-occurring exonuclease 

io activity of naturally occurring T7-type DNA polymerase. 

18. A purified modified gene according to any of claims 14 to 17 characterised in that said gene encodes a 
processive modified DNA polymerase modified to reduce the activity of the naturally occurring 
exonuclease activity to less than 500 units per milligram of polymerase. 

75 

19. A purified modified gene as claimed in any of claims 14 to 18 further characterised in that said gene 
has been modified to eliminate the naturally occurring exonuclease activity of the naturally occurring 
DNA polymerase. 

20 20. A purified modified gene according to any of claims 14 to 19 further characterised in that a His residue 
of the 3'- 5 f exonuclease domain of the naturally occurring DNA polymerase is replaced or deleted. 

21. A purified modified gene according to any of claims 14 to 20 characterised in that said processive 
modified T7-type DNA polymerase is 17 DNA polymerase. 

25 

22. A purified modified gene as claimed in claim 21 characterised in that His 123 of the naturally occurring 
T7 DNA polymerase is replaced or deleted. 

23. A purified modified gene according to claim 21 characterised in that Ser 122 and His 123 are replaced 
30 or deleted. 

24. A purified modified gene according to claim 21 characterized in that amino acid residues Lys 118 to 
His 123 are deleted. 

35 25. A purified modified gene according to claim 21 characterized in that Lys 118 and Arg 119 of the 
naturally occurring T7 DNA polymerase are replaced or deleted. 

26. A purified modified gene according to claim 21 characterized in that Arg 131, Lys 136, Lys 140, Lys 
144 and Arg 145 of the naturally occurring T7 DNA polymerase are replaced or deleted. 

40 

27. The use of a modified processive DNA polymerase produced according to the method of claim 1 for 
DNA sequencing. 

PatentansprUche 

45 

1. Verfahren zum Herstellen einer gereinigten, modifizierten DNA-Polymerase, bei dem ein modifiziertes 
Gen expremiert wird, das filr eine modifizierte prozessive DNA-Polymerase codiert, die eine ausrei- 
chende DNA-Polymeraseaktivitat zur Verwendung beim DNA-Sequenzieren aufweist, wenn die Polyme- 
rase mit einem Kofaktor kombiniert wird, der fiir die DNA-Polymeraseaktivitat notwendig ist, und das 

50 durch die Modifikation eines natOrlich auftretenden Gens erhalten wird, das dahingehend modifiziert ist, 
daB eine oder mehrere Aminosauren in der 3'- 5'-ExonukleasedomMne der naturlich auftretenden DNA- 
Polymerase entfernt oder durch eine Aminosaure ersetzt ist bzw.sind, die sich von der an der 
Substitutionsstelle natOrlich auftretenden Aminosaure unterscheidet, so daB die natUrliche Aktivitat der 
3'-5'-Exonukleaseaktivitat der naturlich auftretenden Polymerase vermindert ist. 

55 

2. Verfahren nach Anspruch 1, dadurch gekennzeichnet, daB die Polymeraseaktivitat der modifizierten 
DNA-Polymerase wenigstens 90% der der naturlich auftretenden DNA-Polymerase ist. 
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3. Verfahren nach Anspruch 1 Oder 2, dadurch gekennzeichnet, daB die Polymerase eine modifizierte 
DNA-Polymerase von Baktereophagen vom T7-Typ ist, die eine S'-S'-Exonukleaseaktivitat aufweist, die 
wenigstens 50% niedriger ist als die natUrlich auflrende Exonukleaseaktivitat der natUrlich auftretenden 
T7-Typ-DNA-Polymerase. 

5 

4. Verfahren nach einem der Anspruche 1 bis 3, dadurch gekennzeichnet, daB das Gen fur eine 
prozessive modifizierte DNA-Polymerase codiert, die so modifiziert ist, daB die Aktivitat der natUrlich 
auftretenden y-5'~ Exonukleaseaktivitat auf weniger als 500 Einheiten pro Milligramm Polymerase 
vermindert ist. 

70 

5. Verfahren nach einem der AnsprUche 1 bis 4, dadurch gekennzeichnet, daB das Gen modifiziert ist, urn 
die natUrlich auftretende Exonukleaseaktivitat der natUrlich auftretenden DNA-Polymerase zu eliminie- 
ren. 

75 6. Verfahren nach Anspruch 4, bei dem der natUrlich auftretende His-Baustein der natUrlich auftretenden 
DNA-Polymerase ersetzt Oder entfernt ist. 

7. Verfahren nach einem der Anspruche 4 bis 6, dadurch gekennzeichnet, daB die modifizierte T7-Typ- 
DNA-Polymerase eine T7-Polymerase ist. 

20 

8. Verfahren nach Anspruch 7, dadurch gekennzeichnet, daB His 123 der natUrlich auftretenden T7-DNA- 
Polymerase ersetzt Oder entfernt ist. 

9. Verfahren nach Anspruch 7, dadurch gekennzeichnet, daB Ser 122 und His 123 ersetzt Oder entfernt 
25 sind. 

10. Verfahren nach Anspruch 7, dadurch gekennzeichnet, daB die Aminosaurebausteine Lys 118 bis His 
123 entfernt sind. 

30 11. Verfahren nach Anspruch 7 oder 9, dadurch gekennzeichnet, daB Lys 118 und Arg 119 der natUrlich 
auftretenden T7-DN A- Polymerase ersetzt oder entfernt sind. 

12. Verfahren nach Anspruch 7, dadurch gekennzeichnet, daB Arg 131, Lys 136, Lys 140, Lys 144 und Arg 
145 der natUrlich auftretenden T7-DN A- Polymerase ersetzt oder entfernt sind. 

35 

13. Verfahren nach einem der AnsprUche 1 bis 12, bei dem die modifizierte prozessive DNA-Polymerase 
unter Bedingungen, wie sie normalerweise fUr die DNA-Sequenzierungsreaktionen verwendet werden, 
in der Lage ist, wenigstens 500 Basen lang an der DNA gebunden zu bleiben. 

40 14. Gereinigtes modifiziertes Gen, dadurch gekennzeichnet, daB es fur eine prozessive modifizierte T7- 
Typ-DNA-Polymerase codiert, die in der Lage ist, unter Bedingungen, die normalerweise fur die DNA- 
Sequenzierungsreaktionen verwendet werden, wenigstens 500 Basen lang an DNA gebunden zu 
bleiben, und die zur Verwendung bet der DNA-Sequenzierung eine ausreichende DNA-Polymeraseakti- 
vitat aufweist, wenn die Polymerase mit einem Wirtsfaktor zusammengebracht wird, der fUr die 

45 Polymeraseaktivitat notwendig ist, und das sich aus der Modrfikation eines natUrlich auftretenden Gens 
ergibt, das modifiziert ist, urn die Aktivitat der natUrlich auftretenden 3'-5'- Exonukleaseaktivitat der 
natUrlich auftretenden Polymerase zu vermindern, bei der eine oder mehrere AminosSuren der 
Exonukleasedomane innerhalb der aminoterminalen Halfte der T7-DNA-Polymerase der natUrlich auftre- 
tenden DNA-Polymerase oder die entsprechende Domane anderer T7-Typ-DNA-Polymerasen fehft 

50 bzw. fehlen oder durch eine Aminosaure ersetzt ist bzw. sind, die sich von der an der Ersatzstelle 
naturlicherweise vorhandenen Aminosaure unterscheidet. 

15. Gereinigtes modifiziertes Gen nach Anspruch 14, dadurch gekennzeichnet, daB eine oder mehrere der 
Aminosauren der Exonukleasedomane von dem Aminoende zu dem Aminosaurebaustein 224 der T7- 
55 DNA-Polymerase der natUrlich auftretenden DNA-Polymerase oder der entsprechenden Domane ande- 
rer T7-Typ-DNA-Polymerasen entfernt oder durch eine Aminosaure ersetzt ist bzw. sind, die sich von 
der an der Substitutionstelle natUrlich auftretenden Aminosaure unterscheidet. 
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16. Gereinigtes modifiziertes Gen nach Anspruch 14 oder 15, dadurch gekennzeichnet, daB die Polymera- 
seaktivitat der prozessiven modifizierten T7-Typ-Polymerase wenigstens 90% der natUrlich airftreten- 
den T7-Typ-DNA-Polymerase ist. 

s 17. Gereinigtes modifiziertes Gen nach einem der AnsprOche 14 und 15, dadurch gekennzeichnet, daB die 
Polymerase eine Exonukleaseaktivitat aufweist, die wenigstens 50% kieiner ist als die naturlich 
auftretende Exonukleaseaktivitat der natQrlich auftretenden T7-Typ-DNA-Polymerase. 

18. Gereinigtes modifiziertes Gen nach einem der Anspruche 14 bis 17, dadurch gekennzeichnet, daB das 
io Gen ftir eine prozessive modifizierte DN A- Polymerase codiert, die modifiziert ist, urn die Aktivitat der 

natQrlich auftretenden Exonukleaseaktivitat auf weniger als 500 Einheiten pro Milligramm Polymerase 
zu verringern. 

19. Gereinigtes modifiziertes Gen nach einem der Anspruche 14 bis 18, dadurch gekennzeichnet, daB das 
75 Gen modifiziert wurde, urn die naturlich auftretende Exonukleaseaktivitat zu eliminieren. 

20. Gereinigtes modifiziertes Gen nach einem der Anspruche 14 bis 19, dadurch gekennzeichnet, da8 ein 
His-Baustein der 3'T5'-Exonukleasedomane der naturlich auftretenden DNA-Polymerase ersetzt oder 
entfernt ist. 

20 

21. Gereinigtes modifiziertes Gen nach einem der AnsprUche 14 bis 20, dadurch gekennzeichnet, daB die 
prozessive modifizierte T7-Typ-DNA-Polymerase eine T7-DNA-Polymerase ist. 

22. Gereinigtes modifiziertes Gen nach Anspruch 21 , dadurch gekennzeichnet, daB His 123 der naturlich 
25 auftretenden T7-DNA-Polymerase ersetzt oder entfernt ist. 

23. Gereinigtes modifiziertes Gen nach Anspruch 21, dadurch gekennzeichnet, daB Ser 122 und His 123 
ersetzt oder entfernt sind. 

30 24. Gereinigtes modifiziertes Gen nach Anspruch 21, dadurch gekennzeichnet, daB die Aminosaurebaustei- 
ne Lys 118 bis His 123 entfernt sind. 

25. Gereinigtes modifiziertes Gen nach Anspruch 21, dadurch gekennzeichnet, daB Lys 118 und Arg 119 
der natQrlich auftretenden T7-DNA-Polymerase ersetzt oder entfernt sind. 

35 

26. Gereinigtes modifiziertes Gen nach Anspruch 21, dadurch gekennzeichnet, daB Arg 131, Lys 136, Lys 
140, Lys 144 und Arg 145 der naturlich auftretenden T7-DNA-Polymerase ersetzt oder entfernt sind. 

27. Verwendung einer modifizierten prozessiven DNA-Polymerase, die nach dem Verfahren riach Anspruch 
40 1 hergestellt ist, zum Sequenziereri. 

Revendications 

1. Procede* de production d'une ADN polymerase mbdifiee et purifiee, selon lequel on exprime un gene 
45 modifie\ ce gene codant pour une ADN polymerase progressive modifie*e ayant une activite d f ADN 

polymerase suffisante pour §tre employee dans un sequengage d'ADN lorsque cette polymerase est 
associee a tout co-facteur necessaire a cette activite" d'ADN polymerase, celui-ci resultant de la 
modification d'un gene nature! modifie par remplacement d'un ou plusieurs aminoacides dans le 
domaine d'exonuclease 3'-5* de cette ADN polymerase naturelle, par un aminoacide autre que celui se 
so trouvant naturellement au site de substitution, ou par deletion de ceux-ci de facori a reduire Tactivite 
d'exonuclease naturelle 3'-5* de I'ADN polymerase naturelle. 

2. Procede selon la revendication 1 , caracterise en ce que ('activite de polymerase de I'ADN polymerase 
modifiee, est d'au moins 90 % de celle de I'ADN polymerase naturelle. 

55 

3. Procede selon la revendication 1 ou 2, caracterise en outre en ce que la polymerase est une ADN 
polymerase bacteriophagique de type T7 modifiee, ayant une activite d'exonuclease 3'-5\ au moins 50 
% inferieure a Tactivite d f exonuciease naturelle de I'ADN polymerase de type T7 naturelle. 
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4. Procede selon Tune quelconque des revendications 1 a 3, caracterise en ce que fe gfene code pour une 
ADN polymerase progressive modifiee de fagon a r^duire I'activite d'exonuciease naturelle 3-5', a 
moins de 500 unites par mg de polymerase. 

s 5. Procede selon Tune quelconque des revendications 1 a 4, caracterise en outre en ce que le gene a ete 
modifie de fagon a supprimer I'activite d'exonuciease naturelle de I'ADN polymerase naturelle. 

6. Proced6 selon la revendication 4, dans lequel on remplace ou on supprime un residu His naturel de 
I'ADN polymerase naturelle. 

70 

7. Procede selon Tune quelconque des revendications 4 a 6, caracterise en que I'ADN polymerase 
progressive modifiee de type T7 est une ADN polymerase T7. 

8. Procede selon la revendication 7, caracterise en ce que le residu His 123 de I'ADN polymerase T7 
75 . naturelle, est remplace ou supprime. 

9. Procede selon la revendication 7, caracterise en ce que les residus Ser 122 et His 123 sont remplaces 
ou supprimes. 

20 10. Procede selon la revendication 7, caracterise en ce que les residus derives d'aminoacide Lys 118 a His 
123, sont supprimes. 

11. Procede selon la revendication 7 ou 9, caracterise en ce que les residus Lys 118 et Arg 119 de I'ADN 
polymerase T7 naturelle, sont remplaces ou supprimes. 

25 

12. Procede selon la revendication 7, caracterise en ce que les residus Arg 131, Lys 136, Lys 140, Lys 144 
et Arg 145 de I'ADN polymerase 17 naturelle, sont remplaces ou supprimes. 

13. Procede selon Tune quelconque des revendications 1 a 12, dans lequel I'ADN polymerase progressive 
30 modifiee, est capable de rester liee a de I'ADN sur au moins 500 bases dans les conditions 

normalement employees pour des reactions de sequengage de I'ADN. 

14. Gdne modifie et purifie, caracterise en ce qu'il code pour une ADN polymerase progressive modifiee 
de type T7, cette polymerase etant capable de rester liee a de I'ADN sur au moins 500 bases dans les 

35 conditions normalement employees pour des reactions de sequengage d'ADN, et ay ant une activite 
d'ADN polymerase suffisante pour une utilisation dans un sequengage d'ADN lorsque cette polymerase 
est associee a n'importe quel facteur d'hote necessaire a cette activite d'ADN polymerase, resultant de 
la modification d'un g§ne naturel, modifie de fagon a reduire I'activite d'exonuciease naturelle 3'-5' de 
I'ADN polymerase naturelle, un ou plusieurs aminoacides du domaine d'exonuciease compris dans la 

40 moitie amino-terminale de la polymerase, ou dans le domaine correspondant d'autres ADN polymera- 
ses de type T7, etant remplaces par un aminoacide autre que celui naturellement present au site de 
substitution, ou ceux-ci etant supprimes. 

15. Gene modifie et purifie selon la revendication 14, caracterise en ce qu'un ou plusieurs des aminoacides 
45 du domaine d'exonuciease allant de I'extremite amino-terminale jusqu'au residu derive d'aminoacide 

224 de I'ADN polymerase T7 naturelle, ou du domaine correspondant d'autres ADN polymerases de 
type T7 sont remplaces par un aminoacide autre que celui naturellement present dans le site de 
substitution, ou ils sont supprimes. 

so 16. Gene modifie et purifie selon la revendication 14 ou 15, caracterise en ce que Tactivite de polymerase 
de la polymerase progressive modifiee de type T7 est d'au moins 90 % celle de I'ADN polymerase 
naturelle de type T7. 

17. Gene modifie et purifie selon la revendication 14 ou 15, caracterise en ce qu'en outre la polymerase a 
55 une activite exonuciease, au moins 50 % inferieure a I'activite d'exonuciease naturelle de I'ADN 
polymerase naturelle de type T7. 
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18. G&ne modify et purifie selon Tune quelconque des revendications 14 a 17, caracterise en ce que le 
gene code pour une ADN polymerase progressive et modified de fagon a require ['activity d'exonuciea- 
se naturelle, a moins de 500 unites par mg de polymerase. 

19. Gene modffie* et purifie selon Tune quelconque des revendications 14 a 18, caracterise en outre en ce 
que ce gene a 6x6 modifie de fagon a supprimer I'activite d'exonuciease naturelle de I'ADN polymerase 
naturelle. 

20. Gene modifie et purifie selon Tune quelconque des revendications 14 a 19, caracterise en outre en ce 
qu'un r^sidu His du domaine d'exonuciease 3-5* de CADN polymerase naturelle, est remplace* ou 
supprime\ 

21. Gene modifie ou purifie selon Tune quelconque des revendications 14 a 20 f caracterise en ce que cette 
ADN polymerase progressive modifiee de type T7 est une ADN polymerase de type T7. 

22. G6ne modifie et purifie selon la revendication 21, caracterise en ce que le residu His 123 de I'ADN 
polymerase T7 naturelle est remplace ou supprime. 

23. G5ne modifie et purifie selon la revendication 21, caracterise en ce que les residus Ser 122 et His 123 
sont remplaces ou supprimes. 

24. Gfene modifie et purifie selon la revendication 21, caracterise en ce que les residus derives d'aminoaci- 
de Lys 118 a His 123 sont supprimes. 

25. G^ne modifie et purifie selon la revendication 21, caracterise en ce que les residus Lys 118 et Arg 119 
de I'ADN polymerase T7 naturelle sont remplaces ou supprimes. 

26. Gfene modifie et purifie selon la revendication 21, caracterise en ce que les residus Arg 131, Lys 136, 
Lys 140, Lys 144 et Arg 145 de I'ADN polymerase T7 naturelle, sont remplaces ou supprimes. 

27. Utilisation d'une ADN polymerase progressive modifiee, produite selon le procede de la revendication 
1, pour le sequengage d'ADN. 
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FIGURE 5 
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FIGURE 7 (continued) 



3210 


3220 


3230 


3240 


3250 


ATCCAGTTCG 


ATGTAACCCA 


CTCGTGCACC 


CAACTGATCT 


TCAGCATCT7 


3260 


3270 


3280 


3290 


3300 


TTACTTTCAC 


CAGCG7TTCT 


GGGTGAGCAA 


AAACAGGAAG 


GCAAAATGCC 


3310 


3320 


3330 


3340 


3350 


GCAAAAAAGG 


GAATAAGGGC 


GACACGGAAA 


TGTTGAATAC 


TCATACTCTT 


3360 


3370 


3380 


3390 


3400 


CCTTTTTCAA 


TATTATTGAA 


GCATTTATCA 


GGGTTATTGT 


CTCATGAGCG 


3410 


3420 


3430 


3440 


3450 


GATACATATT 


TGAATGTATT 


TAGAAAAATA 


AACAAATAGG 


GGTTCCGCGC 


3460 


3470 


3480 


3490 


3500 


ACATTTCCCC 


GAAAAGTGCC 


ACCTGACGTC 


TAAGAAACCA 


TTATTATCAT 


3510 


3520 


3530 


3540 


3550 


GACATTAACC 


TATAAAAATA 


GGCGTATCAC 


GAGGCCCTTT 


CGTCTTCAAG 



AA 



36 



EP 0 386 859 B1 



FIGups. 8 



10 


20 


30 


40 


50 


G77GACACAT 


A7GAG7C77G 


7GA7G7AC7G 


GC7GA777C7 


ACGACCAG77 


60 


70 


80 


90 


. 100 


CGCTGACCAG 


TTGCACGAGT 


C7CAA77GGA 


CAAAA7GCCA 


GCAC77CCGG 


110 


120 


130 


140 


150 


CTAAAGGTAA 


CTTGAACCTC 


CG7GACA7C7 


7AGAG7CGGA 


C77CGCG77C 


160 


170 


180 


190 


200 


GCG7AACGCC 


AAATCAATAC 


GAC7CAC7A7 


AGAGGGACAA 


AC7CAAGG7C 


210 


220 


230 


24 0 


250 


ATTCGCAAGA 


GTGGCCTTTA 


7GA77GACC7 


7C77CCGG77 


AA7ACGAC7C 


260 


270 


280 


290 


300 


ACTATAGGAG 


AACCTTAAGG 


777AAC777A 


AGACCC77AA 


G7G77AA77A 


310 


320 


330 


340 


350 


GAGATTTAAA 


TTAAAGAATT 


AC7AAG AGAG 


GAC777AAG7 


A7GCG7AAC7 


360 


370 


380 


390 


400 


7CGAAAAGA7 


GACCAAACGT 


7C7AACCG7A 


A7GC7CG7GA 


C77CGAGGCA 


410 


420 


430 


440 


- 450 


ACCAAAGGTC 


GCAAGTTGAA 


7AAGAC7AAG 


CG7GACCGC7 


C7CACAAGCG 


460 


470 


4 80 


490 


500 


TAGCTGGGAG 


GGTCAGTAAG 


A7GGGACG77 


7A7A7AG7GG 


7AA7C7GGCA 


510 


520 


530 


540 


550 


CCGGATCCGG 


TATGAAGAGA 


77G7TAAG7C 


ACGA7AA7CA 


A7AGGAGAAA 


560 


570 


580 


590 


600 


•TCAA7ATGAT 


CG777C7GAC 


A7CGAAGC7A 


ACGCCC7C77 


AGAGAGCCTC 



37 



EP 0 386 859 B1 
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FIGURE 3 (continued) 
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FIGUKE 9 



10 20 30 40 50 

AATGCTACTA CTATTAGTAG AATTGATGCC ACC7777CAG CTCGCGCCCC 
60 70 80 90 100 

AAATGAAAAT ATAGCTAAAC AGGTTATTGA CCATTTGCGA AATGTATCTA 

HO 120 130 140 150 

ATGGTCAAAC TAAATCTACT CGTTCGCAGA ATTGGGAATC AAC7G77ACA 

160 170 180 190 200 

TGGAA7GAAA CTTCCAGACA CCGTACTTTA GTTGCATATT TAAAACATGT 

210 220 230 240 250 

TGAGCTACAG CACCAGATTC AGCAATTAAG CTCTAAGCCA TCCGCAAAAA 

260 270 280 290 300 

TGACCTCTTA TCAAAAGGAG CAATTAAAGG 7AC7C7C7AA TCCTGACCTG 

310 320 330 340 350 

TTGGAGTTTG CTTCCGGTCT GGTTCGCTTT GAAGCTCGAA 77AAAACGCG 

360 370 380 390 400 

ATATTTGAAG TCTTTCGGGC TTCCTCTTAA TCTTTTTGAT GCAATCCGCT 

410 420 430 440 450 

. 77GC77C7GA CTATAATAGT CAGGGTAAAG ACCTGATTTT 7GA777A7GG 

4 60 470 480 490 500 

TCATTC7CGT 7T7C7GAAC7 G777AAAGCA 777GAGGGGG A77CAA7GAA 

510 520 530 540 550 

7A777A7GAC GAT7CCGCAG 7A7TGGACGC TA7CCAG7C7 AAACA7777A 

560 570 580 590 600 

C7A77ACCCC C7C7GGCAAA AC77C7777G CAAAAGCC7C 7CGC7A777"- 

610 620 630 640 650 

GG77777A7C G7CG7C7GG7 AAACGAGGGT 7A7GA7AG7G 77GC7C7TAC 

660 670 680 690 700 

7A7GCC7CGT AA7TCC777T GGCGT7A7G7 A7C7GCA77A G77GAA7G75 

710 720 730 740 750 

G7A77CC7AA A7CTCAAC7G A7GAA7C777 C7ACC7G7AA 7AA7G77G77 

760 770 780 790 800 

CCG77AG7TC G7777A77AA CG7AGA7777 7C77CCCAAC G7CCTGAC7G 

810 820 830 840 850 

G7A7AATGAG CCAG77C77A AAA7CGCA7A AGG7AA77CA CAA7GA77AA 

860 870 880 890 <>C0 



40 



EP 0 386 859 B1 
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FIGURE 9 (continued) 
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FIGURE 9 (continued) 
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FIGURE 9 (continued) 
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FIGLmi 9 (continued) 
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FIGURE 9 (continued) 
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